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INTRODUCTION 


Tuts is the first volume of a five-volume course of higher mathematics 
which has been studied by Soviet mathematicians, physicists and 
engineers for forty years. In the first two editions (1924, 1927), which 
were practically identical, this first volume was written jointly by 
J. D. Tamarkin and V. I. Smirnov, but on the title page of later 
editions, prepared without the late Professor Tamarkin's cognizance 
and deviating from the two earlier editions in many respects, Professor 
Smirnov’s name appears alone. 

Professor Tamarkin’s career and his contributions to both Russian 
and American mathematics are well known to British and American 
readers, but the achievements of Professor Smirnov are known to a 
more restricted circle. Vladimir Ivanovitch Smirnov, who was born 
in 1887, has had a distinguished career in research and teaching which 
fits him ideally for the writing of a comprehensive work of extensive 
proportions. His research has been mainly in the theory of functions 
and of differential equations but he has made valuable contributions 
to applied mathematics and, in particular, to theoretical seismology 
and all his work has been characterized by a broad scientific outlook 
and he has done more than any other Soviet mathematician to main- 
tain and strengthen the connections between mathematics and physics. 
His pupils, among whom are numbered S. L. Sobolev, N. E. Kochin 
and J. A. Lappo-Danilevskii, have maintained this tradition of work- 
ing in both pure and applied mathematics, a tradition which Smirnov 
inherited from his teacher V. A. Steklov. 

Professor Smirnov’s teaching experience in the old Institute of 
Transport, in a technical high school, in the Physics Department of 
the Mathematics and Physics Faculty of the University of Lenin- 
grad, and as Director of the Theoretical Section of the Institute of 
Seismology, Moscow, led him to study the design of a special course 
of higher mathematics for physicists and engineers, a project in the 
course of which he received the counsel of his many physicist friends 
particularly V. A. Fock and T. V. Kravets. The five-volume set of 
which the present volume is the first is the outcome of that study. It 
is, of course, designed as a first course for pure mathematicians in the 

xi 


xii INTRODUCTION 


topics considered as well as for students and research workers whose 
main interest lies in the applications of mathematics. 

The whole work is notable not only for the wealth of the illustrations 
it draws from physics and technology to illuminate points in pure 
mathematics, but also for the clarity of the exposition. This has al- 
ready been recognized in the Soviet Union by the esteem by which 
the author’s work is held by academic teachers, by the award in 1947 
of the State Prize (previously called the Stalin Prize) to the author for 
this work, and it is to be hoped that through Mr. Brown’s translation 
its merits will become just as well known in the English-speaking 
world. 

The present volume is an introduction to calculus and to the prin- 
ciples of mathematical analysis including some introductory material 
on functions of several variables as well as on functions of a single 
variable. As well as providing the material necessary for the under- 
standing of the methods of mathematical physics it is an excellent 
introduction to these subjects for students of pure mathematics. 


I. N. SNEDDON 


PREFACE TO THE EIGHTH RUSSIAN EDITION 


Taux present edition differs very considerably from the last. The 
material relating to analytic geometry has been excluded, and the 
remaining material has been rearranged as a result. In particular, 
applications of the differential calculus to geometry are now to be 
found collected in $ 7 (Chapter II). A chapter has been added which 
was previously the first of Volume II, dealing with complex numbers, 
the basic properties of integral polynomials, and the systematic 
integration of functions. 

Further substantial additions must be mentioned, apart from the 
various minor additions and modifications to the text. In view of 
the fact that quite subtle and difficult problems of higher analysis 
are encountered in later volumes, it was thought useful to give the 
theory of irrational numbers, and its use in proving tests for the 
existence of limits and the properties of continuous functions, at the 
end of § 2 (Chapter I) after the theory of limits. A rigorous definition 
and study of the properties of the elementary function is also to be 
found there. The proof of the existence of implicit functions is included 
in Chapter V, dealing with functions of several variables. 

The text is arranged so that the large type can be read independently. 
The small type sections contain examples, some additional particular 
problems, all the theoretical material referred to above, and the final 
section of Chapter IV, which deals with theory of a more difficult kind. 

My sincere thanks are due to Professor G. M. Fikhtengol'ts for a 
number of valuable suggestions regarding the text, which I have 
incorporated during the final revision of the book. 


PREFACE TO THE SIXTEENTH RUSSIAN EDITION 


Tue basic text and plan of the book have remained unchanged in the 
present edition, though there are a number of alterations due to the 
requirements of accuracy and completeness. This refers especially to 
applications of the differential and integral calculus to geometry. 


V. SMIRNOV 


xiii 


PREFACE TO THE EIGHTH RUSSIAN EDITION 


Taux present edition differs very considerably from the last. The 
material relating to analytic geometry has been excluded, and the 
remaining material has been rearranged as a result. In particular, 
applications of the differential calculus to geometry are now to be 
found collected in $ 7 (Chapter II). A chapter has been added which 
was previously the first of Volume II, dealing with complex numbers, 
the basic properties of integral polynomials, and the systematic 
integration of functions. 

Further substantial additions must be mentioned, apart from the 
various minor additions and modifications to the text. In view of 
the fact that quite subtle and difficult problems of higher analysis 
are encountered in later volumes, it was thought useful to give the 
theory of irrational numbers, and its use in proving tests for the 
existence of limits and the properties of continuous functions, at the 
end of § 2 (Chapter I) after the theory of limits. A rigorous definition 
and study of the properties of the elementary function is also to be 
found there. The proof of the existence of implicit functions is included 
in Chapter V, dealing with functions of several variables. 

The text is arranged so that the large type can be read independently. 
The small type sections contain examples, some additional particular 
problems, all the theoretical material referred to above, and the final 
section of Chapter IV, which deals with theory of a more difficult kind. 

My sincere thanks are due to Professor G. M. Fikhtengol'ts for a 
number of valuable suggestions regarding the text, which I have 
incorporated during the final revision of the book. 


PREFACE TO THE SIXTEENTH RUSSIAN EDITION 


Tue basic text and plan of the book have remained unchanged in the 
present edition, though there are a number of alterations due to the 
requirements of accuracy and completeness. This refers especially to 
applications of the differential and integral calculus to geometry. 


V. SMIRNOV 


xiii 


CHAPTER 1 


FUNCTIONAL RELATIONSHIPS 
AND THE THEORY OF LIMITS 


§ 1. Variables 


l. Magnitude and its measurement. Mathematical analysis has a 
fundamental importance for exact science; unlike the other sciences, 
each of which has an interest only in some limited aspect of the 
world around us, mathematics is concerned with the most general 
properties inherent in all phenomena that are open to scientific 
investigation. 

One of the fundamental concepts is that of magnitude and its 
measurement. It is characteristic of a magnitude that it can be meas- 
ured, i.e. it can be compared in one way or another with some specific 
magnitude of the sort which is accepted as the unit of measurement. 
The process of comparison itself depends on the nature of the magnitude 
in question and is called measurement. Measurement results in an 
abstract number being obtained, expressing the ratio of the observed 
magnitude to the magnitude accepted as the unit of measurement. 

Every law of nature gives us a correlation between magnitudes, 
or more exactly, between numbers expressing these magnitudes. 
It is precisely the object of mathematics to study numbers and the 
various correlations between them, independently of the concrete 
nature of the magnitudes and laws which lead us to these numbers 
and correlations. . 

Thus, every magnitude is related by its measurement to an abstract 
number. This number depends essentially, however, on the unit 
assumed for the measurement, or on the scale. On increasing this 
unit, the number measuring a given magnitude decreases, and con- 
versely, the number increases on decreasing the unit. 

The choice of scale is governed by the character of the magnitude 
concerned and by the circumstances in which the measurements are 
carried out. The size of the scale used for measuring one and the same 
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magnitude can vary within the widest possible limits — for instance, 
in measuring length in accurate optical studies the accepted unit of 
length is an Angstrom (one ten-millionth of a millimetre, 10719 m); 
whereas use is made in astronomy of a unit of length called a light- 
year, i.e. the distance travelled by light in the course of a year (light 
travels approximately 300,000 km in one second). 


2. Number. The number which is obtained as a result of measurement 
may be integral (if the unit goes an integral number of times into the 
magnitude concerned), fractional (if another unit exists, which goes an 
integral number of times both into the measured magnitude and into 
the unit previously chosen — or in short, when the measured magni- 
tude is commensurable with the unit of measurement) and finally, 
irrational (when no such common measure exists, i.e. the given magni- 
tude proves incommensurable with the unit of measurement). 

It is shown in elementary geometry, for instance, that the diagonal 
of a square is incommensurable with its side, so that, if we measure 
the diagonal of a square using the length of side as unit, the number 
/2 obtained by measurement is irrational. The number z is similarly 
irrational, obtained on measuring the circumference of a circle, the 
diameter of which is taken as unit. 

Reference can usefully be made to decimal fractions, in order to 
understand the idea of irrational numbers. As is known from arith- 
metic, every rational number can be represented in the form of either 
a finite or an infinite decimal fraction, the infinite fraction being peri- 
odic in the latter case (simple periodic or compound periodic). For 
instance, on carrying out division of the numerator by the denominator 
in accordance with the rule for division into decimal fractions, we 
obtain: 


= 0.151515... =0.1(5), 


= 0.2777.. . = 0.2 (7). 


= w 
jo glo 


Conversely, as is known from arithmetic, every periodic decimal 
fraction expresses a rational number. 

In measuring a magnitude, incommensurable with the unit taken, 
we can first reckon how many times a full unit goes into the measured 
magnitude, then how many times a tenth of a unit goes into the re- 
mainder obtained, then how many times a hundredth of a unit goes into 
the new remainder and so on. Measurement of a magnitude, incommen- 


2] NUMBER 3 


surable with the unit, will thus lead to the formation of an infinite 
non-periodic decimal fraction. An infinite fraction of this sort corres- 
ponds to every irrational number, and conversely, to every infinite 
non-periodic decimal fraction there corresponds a certain irrational 
number. If only a few of the first decimal places are retained in this 
infinite decimal fraction, an approximate value is obtained below 
the irrational number represented by this fraction. Thus, for example, 
on extracting the square root in accordance with the usual rule to 
the third decimal place, we obtain: 


y22 1414... 


The numbers 1.414 and 1.415 are approximate values of |/2 with 
an accuracy of one-thousandth, below and above. 

Decimal places can be used for comparing the sizes of irrational 
numbers with each other, and with rational numbers. 

Magnitudes of different signs, positive and negative, have to be 
considered in many cases (temperatures above and below 0°, positive 
and negative velocities of displacement along a line, and so on). Such 
magnitudes are expressed by corresponding positive and negative 
numbers. If a and b are positive numbers and a < b, then —a > —b, 
and any positive number, including zero, is greater than any negative 
number. 

All rational and irrational numbers are arranged in a certain definite 
order, according to their magnitudes. All these numbers form the 
aggregate of real numbers. 

We shall note one circumstance in connection with the represen- 
tation of real numbers by decimal fractions. We can write an infinite 
decimal fraction with nine recurring in place of any given finite decimal 
fraction. For example: 3.16 — 3.1599... If finite decimal fractions 
are not used, an accurate one-to-one correspondence is then obtained 
between real numbers and infinite decimal fractions, i.e. to every 
real number, except zero, there corresponds a definite infinite decimal 
fraction and to every infinite decimal fraction there corresponds a 
definite real number. Negative numbers can be associated with cor- 
responding infinite decimal fractions with the minus sign in front. 

The four primary operations can be carried out in the domain of real 
numbers, except division by zero. The root of odd degree of any given 
real number always has one specific value. The root of even degree 
of a positive number has two values, which differ only in sign. The root 
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of even degree of a negative real number has no meaning in the domain 
of real numbers. The rigorous theory of real numbers and the opera- 
tions on them is given later in small type in [40]. 

The number expressing a given magnitude is called its arithmetic or 
absolute value when associated with the + sign. The absolute value of 
the magnitude expressed by the number a, or in other words, the 
absolute value of the number a, is denoted by the symbol | a |. Thus 
we have: 


|a | = a, if a is a positive number, 
|a | = —a, if a is a negative number. 


It can easily be shown that the absolute value of the sum |a + b | 
is equal to the sum of the absolute values of the parts, |a |+ |b|, 
only if the parts have the same sign; otherwise, it will be less, so that 
we have: 


la b|« |a| 4 |b]. 


For example, the absolute value of the sum of the numbers (4-3) 
and (—7) is equal to four, but the sum of the absolute values of the 
parts is equal to ten. 

Similarly, it can be shown that 


[a —b| > |a|—]5| 


on the assumption that |a | > |5 |. 

The absolute value of the product of any number of factors is equal 
to the product of the absolute values of these factors, and the absolute 
value of a quotient is equal to the quotient of the absolute values of 
numerator and denominator, i.e.: 

labe| = [al | |o] and || = a . 

3. Constants and variables. The magnitudes studied in mathematics 
are divided into two classes: constants and variables. 

A magnitude is called a constant when it retains the same (invariable) 
value in a given investigation; a magnitude is a variable when, for 
one reason or another, it can assume different values in a given investi- 
gation. 

It is evident from these definitions that the concepts of constant 
and variable are largely a matter of convention and depend on the 
circumstances in which the given phenomenon is studied. A magnitude 
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that may be considered as a constant under certain conditions, can be- 
come a variable under different conditions, and conversely. 

For instance, it is important to know, when measuring the weights 
of bodies, whether the weighing operations are carried out at the same 
point of the earth's surface, or at different points: if the measurements 
are made at the same point, the acceleration due to gravity, on which 
the weight depends, will remain constant, and differences in weight 
between different bodies will depend only on their masses. On the 
other hand, if the measurements are made at different points on the 
earth's surface, the acceleration due to gravity cannot be looked on 
as a constant, since the centrifugal force due to the rotation of the 
earth must be considered. As a result of this factor, the same 
body will weigh less at the equator than at the poles, as can be ob- 
served if a spring-balance is used, instead of a lever-balance. 

Similarly, the length of the rods used in the construction of some 
technical device can be looked on as invariable for the purposes of 
rough calculation. When greater accuracy is needed, so that regard is 
taken of the effect of temperature on the measurement, the length of 
a rod becomes variable, with the natural result that all calculations 
become more complex. 


4. Interval. The character of the change of a variable can be ex- 
tremely diverse. A variable can assume either all possible real values, 
without limits (time t, for example, calculated from some definite 
initial moment, can assume all possible values, both positive and 
negative), or its values are limited by certain inequalities (absolute 
temperature T°, for example, must be greater than — 273? C); or 
finally, a variable can assume only certain, and not all possible, types 
of value (only integral, in the case of the population figure for a given 
year or for the number of molecules in a given volume of gas, or only 
commensurable with a given unit and so on). 

We shall note some of the most common ways in which variables 
change in theory and practice. 

If the variable x can assume every real value permitted by the 
condition a «; z «; b, where a and b are fixed real numbers, we say 
that z varies in the interval (a, b). Such an interval, including its ends, 
is sometimes referred to as a closed interval. If the variable x can 
assume all values in the interval (a, b) except its ends, i.e. a < x < b, 
we say that x varies inside the interval (a, b). Such an interval, with 
its ends excluded, is referred to as an open interval. Furthermore, the 
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domain of variation of x can be an interval, closed at one side and 
open on the other: a <x « bor a « x « b. 

If the domain of variation of x is defined by a « x, we say that x 
varies in the interval (a, + °°), which is closed on the left and open 
on the right. Similarly, if x < b, we have the interval (— co, b), open 
on the left and closed on the right. If x can assume any real value, we 
say that x varies in the interval (— œ, + ^o), open on both sides. 


5. The concept of function. We are concerned in most applications 
not with one variable, but with several variables at once. 

Let us take the example of a certain quantity of air, say 1 kg; 
the variables defining its state are: its pressure p (kg/m?), the volume 
v (m?) which it occupies; its temperature £?^C. Let us assume for the 
moment that the temperature of the air is maintained at 0?C; the 
number f is then a constant, equal to zero. The only remaining 
variables are p and v. If the pressure p changes, then the volume v 
changes; for example, if the air is compressed, the volume decreases. 
We ean change p arbitrarily (at least within the limits technically 
attainable), in which case we can refer to p asan independent variable; 
for every pressure p, there is evidently a completely defined volume. 
There must thus be a law which enables the corresponding volume 
v to be found for every value of p. This is, of course, Boyle's law, 
which says that the volume occupied by a gas at constant tem- 
perature is inversely proportional to the pressure. 

Applying this law to our kilogram of air, the relationship between 
v and p can be put in the form of an equation: 


2 ; 
= 73 X 29.27 i 
P 


The variable v is in this case called a function of the independent 
variable p. 

Turning from this particular example, we can say that, theoretically 
speaking, an independent variable is characterized by a large number 
of possible values, its value being any one chosen arbitrarily from all 
these possible values. The independent variable x, for example, can 
have a set of values consisting of the interval (a, b), or the inte- 
rior of this interval, i.e. the independent variable x can take any 
value satisfying the condition a < x < b, or a < x < b. It might be 
the case that x takes any integral value, etc. In the example quoted 
above, p had the role of independent variable, and the volume v was 
a function of p. We shall now define a function theoretically. 
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DEFINITION. A quantity y is called a function of the independent 
variable x, if for any given value of x (from all its possible values) there 
corresponds a definite value of y. 

Thus, if y is a function of x, defined in the interval (a, b), this means 
that there is a corresponding definite value of y for any value of x 
from this interval. 

It is mostly just a matter of convenience which of two magnitudes, 
x or y, is to be taken as the independent variable. In our example, 
we could have changed the volume v arbitrarily and defined the 
pressure p each time, thus having v as the independent variable and 
as a function of v. We obtain an expression for p as a function of 
the independent variable by solving the equation given above in 


terms of p: 
273 x 29.27 
p == RS . 


What has been said in regard to two variables is extended without 
difficulty to the case of any desired number of variables; and we can 
distinguish here between independent variables, and dependent variables 
or functions. 

Returning to our example, let us assume that the temperature is 
no longer 0°C, but can change. Boyle’s law must now be replaced by 
the more complex relationship of Clapeyron: 


pv = 29.27 (273 + T), 


which shows that, when studying the gaseous state, only two of the 
magnitudes p, v and T can be changed arbitrarily, the third being 
fully defined when the values of these two are given. We can take p 
and T as independent variables, for example, in which case vis a func- 


tion of them: 
_ 29.27 (273 + T) | 


i p 
Or similarly, v and T can be independent variables, and p a function 
of them. 


Let us take another example. The area S of a triangle is given in 
terms of the lengths of sides a, b, c, by the formula: 
S = Yp(p—a)(p—95)(p—o, 
where p is half the perimeter of the triangle: 


| | atb+e 
Eee bey 
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The sides a, b, c can vary arbitrarily, provided only that each side 
is greater than the difference, and less than the sum, of the other two. 
The variables a, b, c are thus independent variables restricted by certain 
inequalities, whilst S ts a function of them. 

We can also fix two sides, say a, b, and the area S of the triangle; 
using the formula: 


S = ab sin C, 


where C is the angle between sides a, b; we can then find C. The magni- 
tudes a, b, S are the independent variables here, and C the function. 
The variables a, b, S must be restricted in this case by the condition: 


a 28 
sin O-— SL 


It may be noted that we obtain two values for C in this example, 
depending on whether we take the acute or obtuse angle, both of which 


have the same sine: 


sin C= EM 
ab 


We meet here the concept of a many-valued function, about which 
more will be said below. 


6. The analytic method of representing functional relationships. 
Every law of nature, connecting certain phenomena with others, 
establishes a functional relationship between magnitudes. 

There are many ways of representing a functional relationship, but 
the most important are the three following: (1) the analytie, (2) the 
tabular, and (3) the graphical, or geometrical method. 

We say that a functional relationship between magnitudes, or more 
simply, a function, is represented analytically, if these magnitudes are 
connected with each other by equations. These equations contain the 
magnitudes, subject to the various mathematical operations: addition, 
subtraction, division, taking logarithms etc. We always arrive at an 
analytic representation of a function on studying a problem theoreti- 
cally: which means that, having established basic premises, we make 
use of mathematical analysis and obtain results in the form of mathe- 
matical formulae. For instance, in celestial mechanics, all the possible 
motions, positions and interactions of the heavenly bodies are deduced 
from a single basie law, that of universal gravitation. 

If we have a direct expression for the function (ie. the dependent 
variable) in terms of mathematical operations on the independent 
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variables, we say that the function is given explicitly. The expression 
for the volume v of a gas at constant temperature in terms of the 
pressure is an example of an explicit function (of one independent 
variable): 
273 x 29.27 

Dp 


Similarly, the expression for the area S of a triangle in terms of the 
sides: 


S = plp — a) (p — 5) (p— o). 
is an example of an explicit function of three independent variables. 
Another example may be given of an explicit function of one indepen- 
dent variable: 
y = 22? — 3x +7. (1) 


It is often inconvenient or impossible to write down the formula 
expressing a function in terms of the independent variables. We can 
write briefly instead: 

y = f(z). 


This notation means that y is a function of the independent variable 
z, and f symbolizes the dependence of y on x. Of course another letter 
can be used in place of f. If we are considering several different func- 
tions of x, several different letters must be used to express symboli- 
cally the dependence on z: 


f(x), F(x), p(x), etc. 


This notation is not only used when the function is given analyti- 
cally, but is used in the general case of functional dependence, as 
defined in [5]. 

Use is made of an analogous abbreviated notation for functions of 
several independent variables: 


v = F(x, y, 2). 


Here, v is a function of the variables z, y, 2. 

We obtain particular values of functions by giving the independent 
variables particular values and carrying out the operations indicated 
by the symbols f, F,... For example, the particular value of the 
function (1) for x = 1/2 is: 


= 


y=2: (5) -3:+1=8. 
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In general, the particular value for x = x, of some function f(x) is 
denoted by f(z,) and similarly for functions of several variables. 

It is as well not to confuse the general concept of a function, which 
we gave in [5], with the concept of the analytic expression of y in 
terms of x. Reference is made in the general definition of a function 
only to some law, in accordance with which there is a corresponding 
definite value of y for any one of the set of possible values of the 
variable x. No analytic expression (formula) for y in terms of x is 
assumed here. It may be further remarked that a function can be 
defined by different analytic expressions for different portions of the 
domain of variation of the independent variable. For example, we can 
define the function y in the interval (0, 3) in the following way: 
y=x+5fr0<xw<2, and y — 11 — 2z for 2 <x < 3. A cor- 
responding value of y is in this case defined for any given x in the 
interval (0, 3), which agrees with the definition of a function. 


7. Implicit functions. A function is called implicit when we have 
no direct analytic expression for it in terms of the independent 
variables, but only an equation relating its values to those of the 
independent variables. For instance, if a variable y is related to a vari- 
able x by the equation: 

y —a$—0 , 


y is an implicit function of the independent variable x; or on the other 
hand, x can be reckoned an implicit function of the independent 
variable y. 

An implicit function v of several independent variables x, y, 2, ... 
is defined in general by an equation: 


F(x, Y, 2, ...,0) = 0. 


We can only compute the value of this function by solving the 
equation with respect to v, thus putting v in the form of an explicit 
function of z, y, 2, . 


V = PZ, Y, 2%, ...). 


In the above example, y is expressed in terms of x as: 


y = Væ. 
However, it is by no means essential to solve the equation to obtain 
the various properties of the function v; more often than not, an 
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implicit function can be studied quite well from the equation that de- 
fines it, without attempting a solution. 

The volume v of a gas, for example, is an implicit function of the 
pressure p and temperature T, defined by the equation: 


pv R(273 + T). 


The angle C between sides a and b of a triangle of area S is an 
implicit function of a, b, S, defined by the equation: 


ab sin C = 28. 


8. The tabular method. The analytic method of representing a 
function is primarily used in theoretical work, when it is a matter of 
general laws. In practical work, which requires the computing of a 
large number of particular values of different functions, an analytic 
presentation is often unwieldy, since it means carrying out all the 
necessary calculations in each case. 

To avoid this, particular values of the most commonly occurring 
functions are computed for a large number of particular values of 
the independent variables, and tables are compiled. 

For example, there are tables of values of the functions: 


1 = 1 A 
y= —-; Va; nw; 72"; logy); logy) sin x; log, cosa; ete. 


Such functions are constantly met with in practical work. Very useful 
tables have also been compiled for more complicated functions, such 
as Bessel functions, elliptic functions etc. There are further tables 
for functions of more than one variable, the simplest example being 
the ordinary multiplication table, i.e. a table of values of the function 
z = xy for different integral values of x and y. 

Occasionally, the tables only give the function for particular values 
of the independent variables, adjacent to those values for which the 
function is required; in order to make it possible to use the tables in 
this case, various rules of interpolation exist; one such rule is given in 
school courses on using logarithmic tables (proportional parts). 

Tables have special importance when they represent a function, 
the analytic expression for which is unknown; this is the case when 
an experiment is carried out. Every experimental investigation aims 
at discovering hidden functional relationships, and experimental results 
are set in the form of a table, relating the different values of the mag- 
nitudes investigated in the experiment. 
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9. The graphical method of representing numbers. Passing to the 
graphical method of representing functional relationships, we shall 
begin with the representation of a single variable. 

Every number z can be represented by a certain interval. Having 
settled once for all on the choice of the unit of length, it is sufficient 
to construct an interval, the length of which is exactly equal to the 
given number x. Thus, every magnitude can be represented geo- 
metrically by an interval, as well as expressed by a number. 


B — Ay A + B 
0 


Fra. 1 


So as to be able to represent negative numbers, we agree to cut 
off intervals along the same straight line, thus attributing a definite 
direction to them (Fig. 1). We agree further to denote every interval 
by a symbol AB, the point A being called the origin, and the point B 
the end of the interval. 

If the direction from A to B coincides with the direction of the line, 
the interval represents a positive number; whilst if the direction from 
A to B is opposite to that of the line, it represents a negative number 
(4, B, in Fig. 1). The absolute value of the number concerned is ex- 
pressed by the length of the interval representing it, irrespective of 
direction. 

The length of the interval AB will be denoted by | AB |; if AB 
represents a number x, we shall simply write: 


xz = AB; jo] = [A B|. 
We can make things more definite by agreeing once and for all to 


locate the origin of all intervals at a previously chosen point O of 


0 A(x) 


IE AMMNAAMM>]——— 00 X 
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the line. Every interval OA, and hence the number v that it rep- 
resents, will now be fully defined by the point A, its end (Fig. 2). 
Conversely, given the number 2, we can define the magnitude and 
direction of the interval OA, and hence its end A. 
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Thus, if we take a straight line X’X (axis) in a given direction, 
and mark out on it a point O (origin), a corresponding point A of this 
line will be defined for every real number x, the interval OA being measured 
by the number x. Conversely, for every point A of the axis there is a 
corresponding fully defined real number x, measuring OA. This number 
x is called the abscissa of the point A; if we want to denote that the point 
A has abscissa x, we shall write A(x). 

If the number x varies, the point A representing it moves on the 
axis. The concept of interval established above becomes easy to visua- 
lize with this graphical representation of the number x. If x varies in 
the interval a < x < 6, the corresponding point on the axis X’ X can 
be situated anywhere in the interval, the ends of which have abscissae 
a and b. 

If we were limited to rational numbers alone, there would be no 
abscissa corresponding to the point A when the interval OA proved 
incommensurable with the chosen unit, in other words, rational 
numbers alone do not fill in all the points of a straight line. This fill- 
ing is achieved by introducing irrational numbers. The proposition 
mentioned above is basic to the graphical representation of a single 
variable: a corresponding real number is defined for every point of 
the axis X’ X, and conversely, a corresponding point of the axis 
X’ X is defined for every real number. 

Let us take two points on the axis X’ X: the point A, with abscissa 
z,, and point A, with abscissa x,. The number x, will correspond to 
the interval O4,, and number x=, to OA,. Taking all the possible 
mutual arrangements of points A, and A,, it is easy to see that the 
interval AA, corresponds to the number (x, — 2), so that the length 
of the interval is equal to the absolute value of the difference (x, — x,): 

|A, Ag] = |z — zl. 

If, for example, z, = —3 and x, = 7, the point A, lies to the left 
of O at a distance equal to 3, and point A, lies to the right of O at 
a distance equal to 7. The section A, A, will have length 10 and will 
be directed along the axis X’ X, i.e. corresponding to it we have the 
number 10 = 7 — (—3) = z, — x, We leave it to the reader to 
analyse other possible arrangements of the points A, and Ay. 


10. Coordinates. We saw above, that the position of a point on a 
line X’ X can be defined by a real number x. We now indicate the 
analogous method of defining the position of a point on a plane. 
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We shall take two mutually perpendicular axes X'X and Y’Y 
on a plane, with their point of intersection O as origin on each (Fig. 3). 
The positive directions on the axes are shown by arrows. We have a 
real number, denoted by the letter z, corresponding to a point of the 
axis X'X. Similarly, a real number denoted by y corresponds to a 
point of the axis Y'Y. If specific values are assigned to x and y, 
points A and B are defined on axes X'X and Y'Y; knowing A and B, 
we can construct the point M as the intersection of lines parallel to 
the axes and passing through A and B. 
































To each pair of values x, y, there corresponds a single fully defined 
position of the point M on the plane of the figure. 

Conversely, to each point .M of the plane there corresponds a fully 
defined pair of values of x, y, corresponding to the points at which 
lines through the point M parallel to the axes intersect the axes 
X'X and Y’Y’. 

With the directions of the axes shown in Fig. 3, x is to be reckoned 
positive or negative, depending on whether A lies to the right or left 
of point O; similarly, y is positive or negative, depending on whether 
B lies above or below, point O. 

The magnitudes x, y defining the position of point M in the plane, 
and defined in turn by point M, are called the coordinates of M. The 
axes X'X, Y'Y are called the coordinate axes, the plane of the figure 
is the plane of coordinates XOY, and the point O is the origin of co- 
ordinates. 

Magnitude x is called the abscissa, and y the ordinate, of the point M. 
We shall specify the point M by its coordinates by writing: M(x, y). 

This method of representation is called the method of rectangular 
coordinates. 
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The signs of the coordinates of the point M when situated in differ- 
ent quadrants of the axes (I-IV) (Fig. 3) can be shown in a table: 





M | I | It | III | IV 





c 
y 


== 4+ 
+ = == 
| 


+ 
+ 





It is obvious that the coordinates x, y of M are equal to the 
distances of M from the coordinate axes, associated with the corre- 
sponding signs. 


11. Graphs. The equation of a curve. We return to x and y, represent. 
ing the point M. Let x and y be connected by a functional relationship 
This means that, on varying x (or y) arbitrarily, a corresponding value 
of y (or x) can be found each time. Every such pair of values x and y 
corresponds to a definite position of the point M on the plane XOY; 
when the values vary, the point M moves over the plane and thus 
traces out a certain curve (Fig. 4), which is called a graphical represent- 
ation (or simply a graph or diagram) 










E y 
of the functional relationship con- 
cerned E-E IST P LR 
: CUAD TTT 





If the relationship is given analyt- 
ically as an equation in explicit form : 


Y= f (x) > 
or in implicit form: 
F(x, y) = 0, 











we call this the equation of the curve, whilst the curve is the graph of 
the equation. A curve and its equation are simply different expressions 
of the same functional relationship, i.e. all points, the coordinates of 
which satisfy the equation of a curve, lie on this curve, and conversely, 
the coordinates of all points lying on the curve satisfy its equation. 

If the equation of a curve is given, the curve itself can be constructed 
more or less accurately on a sheet of graph paper (more strictly, any 
desired number of points lying on the curve can be constructed); 
the more points are taken, the more evident becomes the shape of the 
curve. This method is called plotting a curve. 
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The choice of scale is important in plotting curves. Different scales 
can be chosen for x and y. The plane is taken as a sheet of paper, 
ruled into squares or rectangles, depending on whether the scales of 
x and y are the same or different. It is assumed below that the scales 
of x and y are the same. 

The reader is recommended at this point to plot some curves of 
simple functions, and to vary the scales of x and y. 

The concepts introduced above of the coordinates of a point M, 
of the equation of a curve and the graph of an equation, establish an 
intimate connection between algebra and geometry. On the one hand, 
we Can represent and study an analytic relationship by a visual, 
geometrical method, on the other hand; it becomes possible to solve 
geometrical problems with purely algebraic operations, including 
here the fundamental work of analytic geometry, first undertaken by 
Descartes. 

In view of its extreme importance, we shall formulate again the 
facts that lie at the basis of analytic geometry. If we mark out two 
coordinate axes in a plane, every point of the plane corresponds to a 
pair of real numbers, the abscissa and ordinate of the point, and conver- 
sely, every pair of numbers corresponds to a definite point of the plane, 
the first number being its abscissa and the second its ordinate. A curve 
on the plane corresponds to a functional relationship between x and y, 
that is, to an equation containing x and y which is satisfied if, and only if, 
x and y can be replaced by the coordinates of some point of the curve. 
Conversely, an equation containing two variables x and y has a corres- 
ponding curve, made up of the points of the plane whose coordinates, 
when substituted for x and y, satisfy the equation. 

We now turn to the study of the graphs of simple functions. We 
again note that if we have a functional relationship given by an equa- 
tion in explicit or implicit form: 


y = f(z), or F(x, y) = 0, 


then the curve in the plane of axes X’X, Y’Y corresponding to this 
equation is called the graph of the equation or the graph of the function 
defined by the equation. The abscissae and ordinates of points of this 
curve give mutually corresponding values of the variables x and y, 
connected by the functional relationship. 


Graphs are drawn automatically in recording devices; the variable x 
is usually time; y is the magnitude, tho variation of which with time in- 
terests us: for example, barometrie pressure (barograph) or temperature 
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(thermograph). The indicator which records the relationship between the 
volume and pressure of the gas enclosed in a steam or gas engine is worth 
mentioning. 


12. Linear functions. The simplest function, which at the same time 
has extremely important applications, is the polynomial of the first 
degree: 


y =ax +b, (2) 


where a and b are given constant numbers. We shall see that the graph 
of this function is a straight line. It is called a linear function. 
We shall first consider the case of b equal 


Y 
to zero. The function then has the form: 





y — ax. (3) 


This expresses the fact that the 
variable y is directly proportional to 
the variable x, the constant coefficient 
a being called the coefficient of pro- 
portionality. Fic. 6 

Turning to the figure (Fig. 5), we see 
that equation (3) expresses the following geometric property of the 
graph in question: whatever point M we take on it, the ratio of the 


ordinate y = NM of this point to its abscissa z = ON is equal to the 
constant a. Since, on the other hand, this ratio is equal to the tangent 


of the angle a between the segment OM and the axis OX, it is evident 
that the geometrical locus of M is a straight line, passing through the 


origin of coordinates O at an angle a (or x + a) to the axis OX. 

Angle a is reckoned counter-clockwise from the direction OX. 

The geometrical importance of the coefficient a in equation (3) is 
simultaneously revealed: a is the tangent of the angle a between the 
axis OX and the straight line corresponding to this equation, a being 
therefore called the slope of the straight line. It may be noted that 
if a is a negative number, the angle a is obtuse, and the cor- 
responding line is as shown in Fig. 6. 

Let us now return to the general case of a linear function, viz, to 
equation (2). The ordinate y of the graph of this equation differs from 
the corresponding ordinate of the graph of equation (3) by the constant 
amount b. Thus, we immediately obtain the graph of equation (2), 
if the graph of equation (3) shown in Fig. 5 (for a > 0) is displaced 
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parallel to the axis OY through a distance b: upwards for b positive, 
downwards for b negative. We obtain a straight line, parallel to the 
initial line, and cutting offa segment OM, = 6 on the axis OY (Fig. 7). 

Thus, the graph of function (2) is a straight line, the coefficient a 
being equal to the tangent of the angle that the line makes with the axis 
OX, and the constant term b equal to the segment cut off by the line on the 
axis OY, measured from the origin O. 

Conversely, given any straight line L, not parallel to the axis OY, 
its equation can easily be written in the form (2). In accordance 
with the above, it is sufficient to take the coefficient a equal to the 








Fic. 6 Fra. 7 


tangent of the angle of inclination of this line to the axis OX, and b 
equal to the segment that it cuts off on the axis OY. 

We shall note a particular case, which presents a well-known pecu- 
liarity. Let a = 0. Equation (2) gives for all zx: 


y = b, (2,) 


ie. a “function” of x is obtained, such that its value b remains the 
same for all values of x. It is obvious that the graph of equation (2,) 
is a straight line, parallel to the axis OX and displaced from it by a 
distance | 6 | (upwards for b > 0, and downwards for b < 0). To avoid 
special reservations, we shall sometimes say that equation (2,) also 
defines a function of x. 


13. Increment. The basic property of a linear function. We shall 
establish one important new concept, with which we shall often be con- 
cerned in the study of functional relationships. 

The difference between the final and initial values of the independent 
variable x on transition from the initial value x, to the final value x, 
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is called the increment of x, equal to x, — x,. The difference between the 
final and initial values of the function y = f(x) is called the corresponding 
increment of the function: 


ya — Y, = f) — f(x). (3) 
These increments are often denoted by: 
dz = £, — my AY =Y) — We 


It may be noted here that the increment can be either positive 
or negative, and that the magnitude receiving the “increment” does 
not necessarily increase. 

It must be pointed out that the symbol Ax has to be regarded as 
a single entity in denoting the increment of x. 

We shall consider the case of a linear function, when 


y, = ax, + b and y, = az, + b. 
Subtracting term by term, we have: 


Y, — Yı = A(X, — 2,) (4) 
or 
Ay = adz. 


This equality shows that the linear function y = ax + b has the 
property that the increment of the function (y, — y,) is proportional 
to the increment of the indepen- 
dent variable (x, — x,), the coef- 
ficient of proportionality being 
equal to a, ie. to the slope 
of the graph of the function. 

Turning to the graph itself 
(Fig. 8), corresponding to the in- 
crement of the independent va- 


f 





4Ag-Axtga 


riable we have the segment M,P = | 
= Ag = 2, — zı, and correspond- 
ing to the increment of the func- 
tion, PM, = Ay = Y, — y,; and formula (4) follows at once from 
considering the triangle M,PM,. 

We shall now assume that a certain function has the above property 
of proportionality of the increments of the independent variable and 


Fia. 8 
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of the function, expressed by (4). It follows from this formula that: 


Yo = a(z, — 2,) H Y 


or 
y; = ax, + (y, — aoi). 


We shall take the initial values of the variables z, and y, as com- 
pletely defined, and denote the difference (y, — ax,) by the single 
letter b: 

Yo = ax, + b. 


Since we can take any final values of the variables x, and y,, we can 
simply write in place of the letters x, and y, the letters x and y, and 
rewrite the above equality in the form: 


y = az + b, 


i.e. every function having the above property of proportionality of the 
increments is a linear function y = ax + b, where a is the coefficient 
of proportionality. 

A linear function and its graph, a straight line, can thus serve to 
represent any natural law in which the increments of the magnitudes 
concerned are proportional, as is very often the case. 

13 
| M. Graph of uniform motion. This is one of the most important applica- 


tions, which gives a mechanical interpretation of the equation of a straight 
line and its coefficients. If the point P moves along a certain path (trajec- 
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tory), its position is fully defined by the distance, measured from either side 
along the trajectory from a given point A of it to the point P. This distance, 
ie. the are AP, is called the path traversed, and is denoted by the letter 
8; 8 can be both positive and negative, its values on one side of the initial 
point A being reckoned positive, and on the other side, negative. 
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The path s traversed is a certain function of time ¢; taking time as the 
independent variable, we can draw a graph of the motion, i.e. a graph of the 
functional relationship (Fig. 9): 


8 = f(t); 


this is not to be confused with the trajectory itself. 
The motion is called uniform if the path traversed by a point in any given 
interval of time is proportional to this interval, in other words, if the ratio 


8, — 8 Ae i 10 
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of the path traversed in the 
interval from f, to t, to the 
size of this interval, is a cons- 
tant; this constant is called 
the velocity of the motion and 
is denoted by v. 

It is clear from the above 
that the equation of the graph 
of uniform motion has the 
form: 





















































s = vt + 85; 














the graph itself is a straight 
line, the slope of which is 
equal to the velocity whilst the 
initial ordinate s, ts the initial 
value of the path 8 traversed, 
ie. the value of s at t= 0. 

Figure 10 shows the graph of the motion of a point P, moving with con- 
stant velocity v, in a positive direction from the instant 0 to the instant 
t, (acute angle with axis of £), then with higher constant velocity v, in the 
same direction (larger acute angle) to the instant t, then with negative 
constant velocity v, (in the opposite direction, obtuse angle) back to its 
initial position. In the case of several points, all moving in the same tra- 
jectory (for instance, when making up schedules of the movement of trains 
or trams), this graphical method is particularly suitable as a practical means 
of determining the encounters of the moving points, and in general for 
reviewing the movements as'a whole (Fig. 11). 











15. Empirical formulae. The simplicity of constructing a straight line 
and of thus expressing a law of proportionality between the increment of 
a function and that of the independent variable makes a straight line graph 
an extremely convenient means of arriving at an empirical law, i.e. one 
following directly from the experimental data, without special theoretical 
investigation. 
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We represent graphically on a sheet of millimetre paper the table of experi- 
mental results and thus obtain a series of points. 1f we wish to obtain an 
approximate empirical formula so as to study a functional relationship in 
the form of a linear function, we now draw a straight line, which, if it does 
not pass through all the points given (which, of course, is almost always 
an impossibility), is made at the least to pass between these points in such 
a way that as nearly the same number of points as possible appear on one 
side of the line as on the other, all the points being reasonably close to it. 
Error theory, and observation theory, study more accurate ways of drawing 
the line mentioned, as also for judging the degree of error arising with such 
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an approximation.In less accurate investigations, however, such as in tech- 
nology, the drawing of the empirical line is most simply carried out by the 
“taut string" method, the nature of the method being evident from the 
name. Having obtained the line, we use direct measurement to determine 
its equation: 
y=axc+b, 
which gives the required empirical formula. When obtaining this formula 
we must not lose sight of the fact that different scales are very often in use 
for the magnitudes x and y, i.e. lines with the same slope on the axes OX, 
OY, may represent different numbers. In this case, the slope a will not be 
equal to the tangent of the angle between the line and the axis OX, but 
will differ from this by a factor numerically equal to the ratio of the units 
of length used in representing magnitudes æ and y. 
Example (Fig. 12). 





0.451 0.901 | 1.120] 1.341] 1.520] 1.738 


c | 0.212 1.871 








0.530 | 0.708 
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y | 3.721 4.350 
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4.099 | 4.089 
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The result is: 
y ~ 0.3752 + 3.65. 


(Here, and subsequently, the sign — denotes approximately equal to.) 


16. Parabola of the second degree, The linear function 
y = ax + b 


is a particular case of an integral function of the n-th degree or a poly- 
nomial of the n-th degree: 


y = Aya" --a at 4-... 4-04 42 4-6, 


the simplest case of which, after the linear function, is the polynomial 
of the second degree (n — 2): 


y = az? + be + c; 


the graph of this function is called a parabola of the second degree or 
simply a parabola. 
For the present, we shall consider only the simplest case of a 
parabola : 
y = ax’, (5) 


This curve can easily be plotted. Figure 13 shows the curves y = 2 
(a = 1) and y = —2? (a = —1). The curve corresponding to equation 
(5) is situated wholly above the axis OX for a > 0, and wholly below 
the axis OX for a < 0. The ordinate of this curve increases in absolute 
value when x increases in absolute value, the increase being the faster, 
the greater the absolute value of a. Figure 14 shows a series of graphs 
of the function (5) for different values of a, these values being marked 
in the figure against the corresponding parabolas. 

Equation (5) contains only x?, and hence does not vary on changing 
€ to —z, i.e. if a given point (x, y) lies on the parabola (5), the point 
(—z, y) also lies on it. The two points (x, y) and (—x, y) are evidently 
symmetrical relative to the axis OY, i.e. one of them is the mirror 
image of the other relative to this axis. Thus, if the right-hand 
portion of the plane is turned through 180° about the axis OY and 
combined with the left-hand portion, the part of the parabola lying 
to the right of the axis OY will coincide with the part of the parabola 
lying to the left of this axis. In other words, axis OY is an axis of 
symmetry of the parabola (5). 

The origin of coordinates is the lowest point of the curve for a > 0,. 
and the highest point for a < 0, and is called the vertex of the parabola. 
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The coefficient a is fully defined if one point M, (£o Yo) of the parabola 
is given, not at the vertex, since we have then: 





so that the equation of the parabola (5) now has the form: 


v-wu[zÉ[ (6) 


ES 








8 A 
> - There is an extremely simple meth- 
eA E od of constructing any desired num- 
Wy ber n of points of the parabola, given 
the vertex, the axis of symmetry and 
ZZ ¿de any point M, of it, not at the vertex. 
VEZ We divide the abscissa and ordi- 
ZZ em nate of the given point M, into n 
AN E ————— equal parts (Fig. 15) and draw a pen- 
EORR $ 7 8 cil of lines from the origin to the 





points of division of the ordinate. 
Fra. 15 The points of intersection of these 
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lines with lines through the points of division of the abscissa and parallel 
to the axis OY are points of the parabola. In fact, we have by construction 
(Fig. 15): 











n— l A n— l 
Ly AM y = Yo en , 
, ^—1 5—17] , a, ]? 
yi-—9y: ^" = Yo m = Yo E > 


ie. from (6), the point M, (x,, y,) also lies on the parabola. The proof is 
similar for the remaining points. 
If we have two functions: 





y =fi(x) and y = f(x) 








and their corresponding graphs, 
the coordinates of the points of 
intersection of the graphs satisfy 
both the equations, i.e. the 
abscissae of the points of inter- 
section are solutions of the equa- 
tion : 
































fila) = fx). 


This fact can readily be used 
solve a qudaratic equation approx- 
imately. Having constructed as accurately as possible the graph of 
the parabola 








y= 2 (6,) 


on a piece of millimetre graph paper we can now find the roots of the 
quadratic equation 


a? = px +g (7) 


as the abscissae of the points of intersection of the parabola (6,) and 
the straight line y = px + q. Three cases are shown in Fig. 16, of 
two points of intersection, of one point (tangent to the parabola), 
and of no point of intersection. 


17. Parabola of the third degree. The third degree polynomial : 
y = aa? + ba? + ex + d, 
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has a graph in the form of a curve called a parabola of the third degree. 
We shall consider the simplest case of this curve: 

y = ax. (8) 

For a positive, the signs of x and y are the same; for a negative, 

they are different. In the first case, the curve lies in the first and 

third quadrants of the coordinate axes, and in the second case, it lies 
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in the second and fourth quadrants. Figure 17 illustrates the form of 
the curve for various values of a. 

If x and y change simultaneously to (—x) and (—y), both sides 
of equation (8) change sign, and the equation is not essentially altered, 
ie. if the point (x, y) lies on the curve (8), the point (—2, —y) also 
lies on this curve. The points (x, y) and (—x, —y) are evidently sym- 
metrical relative to the origin O, i.e. the line joining them is bisected 
at O. It follows from this that every chord of the curve (8) that passes 
through the origin of coordinates O is bisected there. In other words: 
the origin of coordinates O is the centre of the curve (8). 

A further particular case of a third degree parabola will be men- 
tioned: 

y = ax? + cx. (9) 
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The right-hand side of this equation is the sum of two terms, and 
consequently, to construct the curve, it is sufficient to draw the 
straight line 

y = cx (10) 


and take the sums of corresponding ordinates of graphs (8) and (10) 
directly from the figure. Figure 18 illustrates the various forms that the 
curve (9) assumes (with a = 1, and 
various c). 


If the curve 
y= x 


is drawn, we obtain a convenient, though 
not too accurate, method for solving graph- 
ically an equation of the third degree: 


z? = pr +4, 
the roots of this equation being, in fact, 


the abscissae of the points of intersection 
of the curve y = z? with the line 








y — pet gq. 





As can be seen from the figure (Fig. 
19), there may be one, two, or three 
points of intersection, but there must be at least one, ie. an equation of 
the third degree must have at least one real root. A rigorous proof of this is given 
later. 


18. The law of inverse proportionality. The functional relationship: 


expresses the law of inverse proportionality between variables x and y. 
Variable y decreases by as many times as x increases. For m > 0, 
x and y have the same sign, i.e. the graph is located in the first and 
third quadrants of the coordinate axes; similarly, the graph is in the 
second and fourth quadrants, for m < 0. As x approaches zero, the 
absolute value of the fraction m/x becomes very large. Conversely, 
for large absolute values of z, the ratio m/x becomes small in 
absolute value. 
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Plotting this curve directly gives us Fig. 20, which shows curves (11) 
for various m, the full-line curves corresponding to m > 0, and the 
broken curves to m < 0, the corresponding value of m being noted 
against each curve. We see that each of the curves drawn, called 
rectangular hyperbolas, has infinite branches, the points on a given 
branch approaching the coordinate axis OX or OY with indefinite 
increase of the abscissa x or the ordinate y, respectively. These lines 
are called asymptotes to the hyperbola. 


Mo (xo; Yo) 




























Y2) 





























Fie. 20 Fra. 21 


The coefficient m in equation (11) is completely defined if any point 
M (Xo, Yo) Of the curve in question is given, since now: 


Lo Yo = mi 


equation (11) can now be written: 


LY = LoYo ` (12) 
or 

v m Le 

Lo v ` 


Hence follows a graphical method for obtaining any required number of points 
of a rectangular hyperbola, given its asymptotes and any one point of it 
M (£o Yo). Taking the asymptotes as coordinate axes, we draw an arbitrary 
pencil of lines OP,, OP}, ... from the origin, and mark off the points of inter- 
section of these lines with the lines y = y, and x = z,. 

We draw new lines parallel to the axes through each pair of points 
of intersection lying on a given line of the pencil; the points of inter- 


Ps (X95 Y3) 
To Ma Kus Yo) 


x 
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section of these new lines are then points of the rectangular hyperbola (Fig. 
21). This follows from the similarity of triangles ORQ, and OSP,: 
SP, = OR Ui Yo 


—=— o —= , 
OS AQ, To Tı 








ie. the point M,(x,, yı) lies on the curve (12). 


19. Power functions. The functions y = ax, y = ax?, y = ax? and 

y = m/x which we have studied above, are particular cases of a func- 
tion of the form: 

y = ax, (13) 


where a and n are any given constants. Function (13) is called in 
general a power function. We shall confine ourselves to drawing the 
curves for z positive and a = 1. Figures 22 and 23 show the graphs 
corresponding to various values of n. 
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The equation y = 2” gives y = 1 at x = 1 for all values of n, i.e. all 
the curves pass through the point (1, 1). The curves rise for n positive 
and x > 1, the rate of rise increasing with the size of n (Fig. 22). 
The function y = 2” is equivalent to a fraction for negative n (Fig. 23). 
Instead of y = x7?, for instance, we can write y = 1/z?. In these cases, 
the ordinate y diminishes with increase of x. The curves corresponding 
to equation (13) are sometimes referred to as polytropic. They are 
often encountered in thermodynamics. 
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It may be noted here, that we take the value of the radical as posi- 
tive, for fractional n; for example, we take as positive at = yz. 


The two constants a and n appearing in equation (13) are defined, provided 
two points of the curve M(x, y,) and M,(z, y,) are given, in which case 
we have: 


Yı = arn, Y= arh; (14) 


we eliminate a by dividing one equation by the other: 


Y E , 
Ya Ta 





\ 
- ^ = b M, exi i) 








Taking logarithms, we obtain n as 


log y, — logy: , 
log x, — log z, ' 


having found n, we obtain a from either of equations (14). 

Figure 24 illustrates a graphical method of obtaining any required number 
of points of the curve (13), given two of its points M,(x,, y) and M.(x, y;). 
We draw two arbitrary lines through the point O at angles a and f to axes 
OX and OY respectively; we then take perpendiculars to the axes from the 
given points M,, M,, intersecting the arbitrary lines in points S,, Sz; Tu, T, 
and intersecting the axes in points Q,, Q,; R,, Ra. We now draw E, T, through 
R, parallel to R, T,, and S, Q, through S, parallel to S, Q, Having finally 
drawn lines parallel to the corresponding axes through T4 and Q}, the inter- 
section of these gives us the point My(%z, ya) of the curve. Taking similar 
triangles, we have: 














ie. 
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whence: 
vi 
g, 





T; = 


It can be shown in exactly the same way that 





EE 
Ys Vi 
Using (14), we find: 
nie 2\n 
AR aa (By = aag 
axn 2; 


i.e. the point (x5, Ys) lies on the curve (13), as was required to be shown. 


20. Inverse functions. We shall introduce a new concept, that of 
inverse function, in order to study further elementary functions. 
As was mentioned in [5], we are free to choose either x or y as inde- 
pendent variable in a functional relationship between them, the actual 
decision being made purely as a matter of convenience. Let us take 
a certain function y = f(x), with x as independent variable. 

The function defined by the same functional relationship y = f(x), 
but having y as independent variable and x as the function 


a = p(y), 


is called the inverse of the given function f(x), this latter being often called 
the direct function. 

The notation for the variables is not important in itself, so that, 
denoting the independent variable in both cases by the letter x, we 
can say that p(x) is the inverse of f(x). For example, if the direct 
functions are 

y = ax + b, y=", 
the inverse functions are 


a—b n 


Y= y = x. 


Finding the inverse function from the equation of the direct function 
is called inversion. 

Let us take the graph of the direct function y = f(x). It can easily 
be seen that the same graph can serve as a graph of the inverse func- 
tion z = g(y). 

In fact, both the equations y = f(x) and x = g(y) give the same 
functional relationship between x and y. Suppose an arbitrary xis given 
in the direct function. If we mark off an interval from the origin O 
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along the axis OX, corresponding to the number x, then erect a per- 
pendicular to OX from the end of this interval as far as its 
intersection with the graph, we obtain the value of y corresponding 
to the chosen x as the length of this perpendicular, with the corres- 
ponding sign. In the case of the inverse function x = g(y), we have 
only to measure off the given value y from the origin O along the axis 
OY, a perpendicular to OY then being erected from the end of this 
segment as far as its intersection 
with the graph. The length of this 
perpendicular, with the relevant 
sign, gives us the value of x cor- 
responding to the chosen y. 

The inconvenience arises here, 
that the independent variable xis 
measured off on one axis, name- 
ly OX, in the first case, whilst 
in the second case the indepen- 
dent variable y is measured off 
on the other axis, OY. In other 
words, we can only keep the 
same graph on transition from 
the direct function y = f(x) 
to the inverse function g = 
= p(y), provided we bear in mind that, on making the transi- 
tion, the axis representing the value of the independent variable 
becomes the axis representing the function, and vice versa. 

To avoid this inconvenience, we turn the plane as a whole on making 
the transition, so that the axes OX and OY change places. 
We simply turn the plane of the figure, together with the graph, 
through 180° about the bisector of the first quadrant of the coordinate 
axes. The axes having changed places, the inverse function x == ply) 
now has to be written in the usual way: y = g(x). Thus, given the graph 
of the direct function y = f(x), the graph of the inverse function y = p(x) 
is obtained simply by turning the plane of the graph through 180° about 
the bisector of the first quadrant of the coordinate axes. 

The full-line curve of Fig. 25 represents a direct function, the 
broken curve being its inverse. A broken line is also used for the 
bisector of the first quadrant of the coordinate axes, the plane of 
the figure being rotated about this, in order to obtain the broken 
curve from the full curve. 
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21. Many-valued functions. It is characteristic of all the graphs of 
elementary functions considered above, that perpendiculars to the 
axis OX cut the graph in not more than one, and for the most part 
in one, point. This means that, given x in the function defined by the 
graph, one corresponding value of y is defined. A function of this 
sort is said to be single-valued. 

If perpendiculars to OX cut the graph in more than one point 
this means that, given x, there are several corresponding ordinates 
of the graph, i.e. several values of 
y. Such functions are called many- 
valued; they have already been 
mentioned in [5]. 

Although the direct function y=f(x) 
is single-valued, its inverse y = g(x) 
can be many-valued. This is evi- 
dent, for instance, from Fig. 25. 

We shall consider in detail one 
elementary case. The continuous 
curve of Fig. 13 is the graph of 
the function y = a?. If the figure 
is turned through 180° about the Fra. 26 
bisector of the first quadrant of the 
axes, the graph of the inverse function y = yz is obtained (Fig. 26). 

Let us consider it in detail. For negative x (to the left of axis OY), 
perpendiculars to OX do not intersect the graph, i.e. the function 
y = /x is not defined for z < 0. This corresponds to the fact that 
the square root of a negative number has no real value. For positive z, 
however, perpendiculars to OX cut the graph in two points, ie. 
for a given positive x we have two ordinates of the graph: MN and 
MN,. The first ordinate gives a certain positive y, the second a 
negative value of the same absolute magnitude. This corresponds to 
the fact that the square root of a positive number has two values, 
equal in absolute magnitude and opposite in sign. It is also evident 
from the figure that we have only one value y = 0 for x = 0. Thus, 
the function y = x, defined for x > 0, has two values for x > 0, 
and one value for x = 0. 

It may be noted that our function y = Vx can be made single- 
valued by taking only part of the graph of Fig. 26. For instance, 
let us take only the part in the first quadrant of the axes (Fig. 27). 
This corresponds to taking only positive values of the square root. 
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It may also be noted that the part of the graph of the function y = yz, 
shown in Fig. 27, is obtained from the part of the graph of the direct 
function y = x? (Fig. 13) lying on the right of axis OY. The part of 
the graph of 
P y =z or y=x* 
lying in the first quadrant of the axes, has already been illustrated in 
Fig. 22. 

We now turn to the case when rotation of a single-valued direct 
function leads to & single-valued inverse function. À new concept 
must be introduced for this. 


Y Y 











Fie. 27 


The function y = f(x) is said to be increasing, if y increases when the 
independent variable x increases, i.e. if x, > x, implies f(x,) > fle). 

With the axes OX, OY as used by us, increasing x implies movement 
along OX to the right, and increasing y, movement upwards along 
OY. It is characteristic of the graph of an increasing function that, 
on moving along the curve in the direction of increasing x (to the 
right), we also move in the direction of increasing y (upwards). 

Let us consider the graph of a single-valued increasing function, 
defined in the interval a < x < b (Fig. 28). Let f(a) = c and f(b) = d, 
so that evidently, since the function is increasing, c < d. If we take 
any y in the interval c<y<d,and draw the corresponding perpendicular 
to the axis OY, this perpendicular will cut our graph in only one 
point, i.e. for every y in the interval e < y < d there is one correspond- 
ing definite value of x. In other words, the inverse of an increasing 
function is single-valued. 

It is clear from the figure that this inverse function is increasing. 

Similarly, the function y = f(x) is said to be decreasing, if increase 
of the independent variable x implies decrease of the corresponding y, 


22] EXPONENTIAL AND LOGARITHMIC FUNCTIONS 35 


Le. if x, > x, implies f(x) > f(x). It can be shown, as above, that 
the inverse of a decreasing function is a single-valued, decreasing 
function. A further important fact will be noted. We assume in all 
our discussions that the graph of the function consists of an unbroken, 
continuous curve. This is equivalent to a special analytic property 
of the function f(x), viz, continuity of the function. A rigorous mathe- 
matical definition of continuity is given in Chapter II, where con- 
tinuous functions are studied. The present chapter aims merely at 
a preliminary acquaintance with basic concepts, which are considered 
systematically in later chapters. 

It may be noted as regards terminology, that when no reference 
is made to a function being many-valued, it can always be assumed 
to be single-valued. 


22. Exponential and logarithmic functions. Let us now return to the 
study of elementary functions. An exponential function is defined by 
the equation 


y=4, (15) 


where the base a is a given positive number, different from unity. The 
value of a” is evident for integral positive x. For fractional positive x, 
a* is defined as the radical a? = ya», where, with even q, we agree 
to take the positive value of the radical. Without entering at present 
into a detailed consideration of the value of a* for irrational x, it 
will simply be said that approximate values of a* are obtained by 
taking approximate values for the irrational x (as described above 
in [2]); and the closer the approximation for x, the closer for a*. 
For instance, as approximations for a¥?, where 


V2 = 1.414213..., 


we have: 


10 190 


al =; art = la^; Q4 = yati; o 


Evaluation of a* for negative x follows from evaluation for positive x, 
since a™ = l/a”. Since we agreed above always to take the positive 
radical in the expression a? — Va», it follows that the function a” 
is positive for all real x. Apart from this, it may be mentioned in 


passing that it can be shown that a* is an increasing function for 


36 FUNOTIONAL RELATIONSHIPS AND THE THEORY OF LIMITS [22 


a > 1, and a decreasing function for 0 <a < 1. A more detailed 
study of this function is given later, in [44]. 

Figure 29 illustrates the graph of function (15) for various values 
of a. We shall note some peculiarities of these graphs. We have, 
first of all, a°= 1 for any a, so that the graph of function (15) passes 
through the point y = 1 on the axis OY for any a, i.e. through the 
point with coordinates 2=0, y = 1. 
For a > 1, the curve rises indefi- 
nitely from left to right (in the direc- 
tion of increasing z). On mov- 
ing to the left, it approaches the 
axis OX indefinitely, without ever 
touching it. The curve is differently 
situated relative to the axes for 
a < 1. On moving to the right, the 
curve indefinitely approaches the 
axis OX, whilst it rises indefinitely 
on moving to the left. Since a” is al- 
ways positive, the graph is evidently 
located above the axis OX. It may be 
noted further, that the graph of the 
function y=(1/a)* can be obtained from the graph of y = a”, by tur- 
ning the figure through 180° about the axis O Y. This follows directly 
from the fact that, with such turning, x becomes —x, and a^ = (1/3. 

A further remark: if a = 1, then y = 1%, so that we have y = 1 
for all x [12]. 

A logarithmic function is defined by the equation: 


























y = loga x. (16) 


By definition of logarithm, function (16) is the inverse of function 
(15). Thus, we can obtain the graph of the logarithmic function 
(Fig. 30) from the graph of the exponential, by turning the curves 
of Fig. 29 through 180° about the bisector of the first quadrant of 
the axes. Since function (15) is increasing for a > 1, the inverse 
function (16) is also increasing and single-valued, being only defined. 
for z > 0, as is evident from Fig. 29 (logarithms of negative numbers 
do not exist). All the graphs of Fig. 30 cut the axis OX in the point 
x = 1. This corresponds to the fact that the logarithm of unity is 
zero for any base. For the sake of clarity, Fig. 31 shows a single graph 
of (16) for a > 1. 
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The concept of logarithmic scale and the theory of the logarithmic slide-rule 
are closely associated with the concept of logarithmic function. 

A scale is called logarithmic when it is drawn on a given line so that the 
length of a division, instead of corresponding with the number which denotes 
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the division, corresponds with the logarithm of this number, usually to base 
10 (Fig. 32). Thus, if a certain division of the scale denotes the number w, 


the length of the segment lz is equal to log,,7, instead of x The length of 
the segment between two points of the scale, denoting x and y, will be: 


Ty — lo = log, y — log, 7 = logy, » (Fig. 32); 
the logarithm of the product xy is obtained simply by adding segment ly 
to lz, since the segment thus obtained is equal to: 

log ig 2 + logi, y = logi, (xy). 


Thus, given a logarithmic scale, multiplication and division of numbers 
can be carried out simply by adding and subtracting segments of the scale, 
this being realized most simply in practice with the aid of two identical scales, 
one of which can slide along the other (Figs. 32 and 33). This is the idea. 
underlying the construction of logarithmic slide-rules. 
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38 FUNCTIONAL RELATIONSHIPS AND THE THEORY OF LIMITS 23] 
Logarithmic paper is often used for calculations; this consists of ruled sheets, 


where the divisions of the axes OX and OY are in accordance with a logarith- 
mic, instead of an ordinary, scale. 


23. Trigonometric functions. We shall only consider the four basic 
trigonometric functions 


y =sinx, y= cosa, 


y = tanz, y = cotz, 





Fra. 34 





Fig. 35 


the independent variable x being measured in radians, i.e. taking 
as unit angle the angle subtended at the centre of a circle by a segment 
equal in length to the radius. 

The graph of the function y = sin x is shown in Fig. 34. It is evident 
from the formula: 


cos 2 = sin (x + +) 


that the graph of y = cos z (Fig. 35) can be obtained from the graph 
of y = sin x simply by displacing it to the left along the axis OX 
by an amount 2/2. 

The graph of the function y = tan x is shown in Fig. 36. The curve 
consists of a series of identical, separate, infinite branches. Each 
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branch is located in a vertical strip of width z and consists of an 
increasing function of x. Finally, Fig. 37 gives the graph of y = cot x, 
which is also made up of separate infinite branches. 
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The graphs obtained by displacing the graphs of y = sina and 
y = cos z to the left or right along the axis OX by an amount 2x, 
coincide with the original graphs, corresponding to the fact that func- 
tions sin z and cos z have period 2z, ie. 


sin (x + 27) — sinz and cos (x + 277) = cos z, 
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for any x. The graphs of functions y = tan x and y = cot x similarly 
coincide on displacement along the axis OX by the amount z. 
The graphs of the functions: 


y = Asinaz, y= Acosaz (A >0, a > 0) (17) 


are always similar to those of y = sin x and y = cos x. For example, 
to obtain the graph of the first of functions (17) from the graph of 
y = sin z, the lengths of all the ordinates of this latter function must 
be multiplied by A, and the scale on the axis OX changed in such 
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a way that the point with abscissa x becomes the point with abscissa 
xfa. Functions (17) are also periodic, with period 2x/a. 
The graphs of the more complicated functions: 


y = Asin (ax + b), y = A cos (ax + b), (18) 


which are referred to as simple harmonic curves, are obtained from the 

graphs of functions (17) by displacement to the left along the axis OX by 

an amount b/a (taking b > 0). Functions (18) also have a period of 2z/a. 
The graph of another more complicated function: 


y = A,sina,x + B, cos a, x + A, sin a,x + B, cos a, v, 


consists of the sum of several terms of type (17), and can be constructed, 
for example, by adding the ordinates of the graphs of the separate 
terms. The curves thus obtained are usually referred to as compound 
harmonic curves. Figure 38 illustrates the construction of the graph 
of the function: 

y = 2sin x + cos 2x. 
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It may be remarked here, that the function: 
y = A, sin a,x + B, cos a,x (19) 


can be written in the form (18) and represents a simple harmonic 
oscillation. 
We write, in fact: 


A, B, 
m—— t= 1 , 
VA + Bi VA} + Bi 
A= VA} + Bi. 
Obviously: 
A,=mA, B,=n4, (20) 
and also: 
m+e=l, 


Im[ <1, |n|«1, 


so that, from trigonometry, an angle b, can always be found such 
that: 
cos b, — m , sin b, — n. (21) 


Substituting for 4, and B, in (19) from expressions (20), and using 
(21), we obtain: 


y = A(cos b, : sin a,x + sin b, * cos a,7), 
Le. 
y = Asin (a, x +b,) 


24. Inverse trigonometric, or circular, functions. These functions are 
obtained by inversion of the trigonometric functions: 
y = sin x, cos z, tan z, cotz, 
their symbols being respectively: 


y = are sin x, arc cos z, arc tan x, aro cot z ; 


these symbols are simply abbreviated forms of description for the 
angle (or arc), of which the sine, cosine, tangent or cotangent is respec- 
tively equal to z. 

We shall consider the function: 


y = arc sin x. (22) 
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The graph of this function (Fig. 39) is obtained from the graph 
of y = sin x by the rule given in [20]. This graph is wholly located 
in the vertical strip of width two, based on the interval — 1 <£ < +1 
of the axis OX, i.e. the function (22) is only defined in the interval 
—l <x < +1. Furthermore, equation (22) is equivalent to the 
equation sin y = 2; and, as is known from trigonometry, for a given 
x we obtain an infinite number of values of y. We see from the graph, 
in fact, that perpendiculars to the axis OX from points in the interval 












































—1 « x « +1 have an infinite number of points in common with 
the graph, i.e. function (22) is many-valued. 

We see directly from Fig. 39 that function (22) becomes single- 
valued if, instead of taking all the graph, we limit ourselves to the 
part shown in heavier type, which corresponds to stipulating that we 
shall consider only those values of the angle y, having a given sin y = x, 
which lie in the interval (—2/2, 2/2). 

Figures 40 and 41 illustrate the graphs of y = are cos x and y = 
= arc tan x, the parts of the graphs in heavier type being those which 
must be kept in order to make the functions single-valued (we leave 
it to the reader to draw the figure for are cot x). It may be noted here 
that the functions y = arc tan x and y = arc cot x are defined for 
all real values of x. 
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By noting from the figure the interval of variation of y over the 
heavier part of the curve, we obtain a table of bounds, within which 
the function remains single-valued: 





Y | arc sin z | arc cos y | arc tan x | arc cot & 
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It can easily be shown that the functions thus defined, called the 
principal values of the inverse trigonometric functions, satisfy the 
relationships: 


i m 
arc sinz -}+ arc cosg = a 
(23) 


E 
are tan z + are cot z = a 


§ 2. The theory of limits. 
Continuous functions 


25. Ordered variables. When referring to the independent variable x, 
we have only been concerned with the set of the values that x 
can assume. For example, this can be the set of values satisfying 
0<xw< 1. We shall now consider the variable x taking an infinity 
of values in sequence, i.e. we are now interested, not only in the set 
of values of z, but also in the order in which it takes these values. 
More precisely, we assume the possibility of distinguishing, for every 
value of x, a value that precedes it and a value that follows it, 
it being also assumed that no value of the variable is the last, i.e. 
whatever value we take, there exists an infinity of successive values. 
A variable of this type is sometimes called ordered. If x”, x" are two 
values of the ordered variable z, a preceding and a succeeding value 
can be distinguished, whilst if x’ precedes x”, and x” precedes x””, 
then x’ precedes x””. We shall assume, for example, that the set of 
values of x is defined by 0 « x « 1, and that of two distinct values 
x’ and a” the succeeding value is the greater. We thus obtain an ordered 
variable, continuously increasing through all real values from zero to 
unity, without reaching unity. The sequence of values of the variable, 
for phenomena occurring in time, is established by the temporal 
sequence, and we shall sometimes make use of this time-scheme below, 
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using terms such as “previous” and “later” in place of “preceding” 
and “succeeding” values. 

An important particular case of an ordered variable is that when 
the sequence of values of the variable can be enumerated, by arranging 
them in a series of the form: 


Hy, y, X3 lll Un eee 


so that, given two values x, and xy, the value succeeds that has the 
greater subscript. In the case mentioned above, when the variable 
increases from zero to unity, we can clearly not enumerate its suc- 
cessive values. It may also be noted that it is possible to encounter 
identical values amongst those of an ordered variable. For example, 
we might have x¿= 7 and z,,— 7 in the enumerated variable. Abs- 
tracting, as we always do, from the concrete nature of the magnitude 
(length, weight etc.), we must understand by the term “ordered 
variable’’, or as we shall say for brevity, “variable”, simply the total 
sequence of its numerical values. We normally introduce one letter, 
say x, and suppose that it assumes successively the above-mentioned 
numerical values. 

For every value of the variable x, a corresponding point K is defined 
on the axis OX. Thus, as z varies in sequence, the point K moves 
along OX. 

The present paragraph is devoted to the basic theory of limits, 
which is fundamental to all modern mathematical analysis. This 
theory considers some extremely simple, and at the same time, ex- 
tremely important, cases of variation of magnitudes. 


26. Infinitesimals. We assume that the point K constantly remains 
inside a certain interval of the axis OX. This is equivalent to the 
condition that the length of the interval OK, where O is the origin, 
remains less than a definite positive number M. The magnitude x is 
said to be bounded in this case. Noting that the length of OK is | |, 
we can give the following definition: 

DEFINITION. A variable x is said to be bounded, if there exists a 
positive number M, such that | v | < M for all values of x. 

We can take x = sina as an example of a bounded magnitude, 
where the angle a varies in any manner. Here, M can be taken as any 
number greater than unity. 

We now consider the case when the point K is displaced successively, 
and indefinitely approaches the origin. More precisely, we suppose 
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that successive displacements of point K bring it inside any previously 
assigned small section S’S of the axis OX with centre O, and that it 
remains inside this section on further displacement. In this case, we 
say that the magnitude x tends to zero or is an infinitesimal. 





We denote the length of the interval 8’S by 2e, where e signifies 





any given positive number. If the point K is inside S’S, then 
OK < £ and conversely, if OK < e, K is inside S'S. We can thus 
give the following definition: The variable x tends to zero or is an 
infinitesimal, if for any given positive e there exists a value of x, such 
that for all subsequent values of x, | x | < e. 

In view of the importance of the concept of infinitesimal, we give 
another formulation of the same definition. 

DErINITION. A magnitude x is said to tend to zero or to be an infini- 
tesimal, if on successive variation | x | becomes, and on further variation 
remains, less than any previously assigned small positive number e. 

The term “infinitesimal” denotes the character of the variation 
of the variable described above, and the underlying concept is not 
to be confused with that of a very small magnitude, which is often 
employed in practice. 

Suppose that, in measuring a certain tract of land, we obtained 
1000 m, with some remainder that we considered very small in com- 
parison with the total length, so that we neglected it. The length of 
this remainder is expressed by a definite positive number, and the 
term “infinitesimal” is evidently not applicable here. If we were 
to meet with the same remainder in a second, more accurate measure- 
ment, we should cease to consider it as very small, and we should 
take it into account. It is thus clear that the concept of a small mag- 
nitude is a relative concept, bound up with the practical nature of 
the measurement. 

Suppose that the successive values of the variable x are 


Lis Way X3, oe Eng ej 


and let e be any given positive number. To prove that x is an infini- 
tesimal, we must show that, starting with a certain value of n, |æn | 
will be less than e, ie. we must be able to find a certain integer N 
such that 


(len| < € for n — N. 


This .N depends on e. 
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As an example of an infinitesimal, we take the magnitude assuming 
successively the values: 


GPP Is (0<g<1. (1) 
We have to satisfy the inequality: 
q" < e or nlogigg < log; €- 


Remembering that log,, q is negative, we can rewrite the above 

inequality as: 

log, € 

log, 9 ' 
since division by a negative number changes the sense of the inequa- 
lity; thus we can now take N as the largest integer in the quotient 
log; € / log,, q. Thus the magnitude in question, or as we usually say, 
the sequence (1) tends to zero. 

If we replace q by (—4) in the sequence (1), the only difference is 
the appearance of the minus sign with odd powers; the absolute 
magnitude of the members of the sequence is as before, and hence 
we also have an infinitesimal in this case. 

The fact that x is infinitesimal is usually denoted by: 


lim z = 0 or z—0. 


Here, lim is an abbreviation of ““limit”” 

We note two properties of infinitesimals. 

1. The sum of any (definite) number of infinitesimals is also an 
infinitesimal. 

Take, for example, the sum w = x + y + z of three infinitesimals, 
and suppose that the variables are enumerated. Let 


Hy, By ...5 Yis Pa ood Sty Sq vee 


be the successive values of z, y, z, respectively. We obtain successive 
values for w: 


Wy = 3, + Y F 21) Wy = Za + Ya F 23... € 


Let e be any given positive number. Since x, y, 2 are infinitesimals, 
we can say that there exists N,, such that | £n | < e/3 for n > Ny; 
N,, such that | y, | < ¢/3 for n > N,; and N, such that | 2, | < £/3 
for n > N, If N denotes the greatest of N,, N, and N¿, we have: 


ln] < E; Ya <a len| < + for n>N, 
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and hence: 


lwa] < les] + |n + [nl << + + for n> N, 


ie. |wn| <e for n >N, whence w=x+y+2 is an infini- 
tesimal. In the general case of non-enumerated variables we can 
look on x, y, z as functions of some ordered variable t: v = x(t), 
y = y(t), z = z(t). Variables x, y, 2 are themselves ordered, so that 
if t = $” precedes ¿ = t”, then z(t’) precedes x(t”), etc. The sum 


w(t) = z(t) + y(t) + 2), 


obtained by adding the x, y, and z corresponding to the same value 
of t, is also ordered. The proof is as above, for enumerated vari- 
ables. In this latter case, t has the role of subscript; or the subscript 
can be looked on as an increasing, integral t. 

2. The product of a bounded magnitude and an infinitesimal is an 
infinitesimal. 

We consider the product of the enumerated variables zy, where x 
is bounded, and y is an infinitesimal. We have the condition that 
|z| remains less than some positive M for any n. If e is any given 
positive number, there exists N, such that | yn | < eM for n >N. 
Thus 





[En Yn] = (tal Iun] «M. <q for n> N. 
Hence, |%n yn | < e for n > N, so that zy — 0. The proof is analogous 
for non-enumerated variables. 

We note that the second property is all the more readily justified 
if z is a constant. We can now take M as any positive number greater 
than |x|, ie. the product of a constant and an infinitesimal is an 
infinitesimal. 

In view of the fundamental importance of the concept of infini- 
tesimal for what follows, we shall pause to add some remarks supple- 
mentary to the above. 

As we have shown, a variable having the sequence of values (1), 
tends to zero, only if 0<q<1 or —1 <q < 0. Setting q = 1/2, 
for example, we obtain the sequence: 


E S 1 


IN 


N 
~ 
e 
~ 
00 
bw 
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Each successive value is less than the previous one in this case, 
and the variable tends to zero, diminishing all the time. Setting 


q = — 1/2, we obtain the sequence: 
E pi Nala o Mia 
E E go gt 


Here the variable tends to zero, taking values in turn greater than, 
and less than, zero. 
Suppose that we insert zero in every other place in the above 
sequence, i. e. we take a variable with the sequence: 
1 


1 1 1 1 
> 39? 


79» 0, 74? 0, "ETF 0, 796. 6, 


1 
0, 64 , 0 e.. 

Clearly, the variable in this case tends to zero, though in the process 
it takes exactly the value zero an infinite number of times. This does 
not contradict the definition of a magnitude tending to zero. 

Finally, suppose that all the successive values of a variable are 
equal to zero. This also comes under the definition of a magnitude 
tending to zero, all the more since | | is now zero all the time, i.e. 
|æ | < e for any given positive e, not only from a certain initial point 
of its variation, but always. In other words, a constant equal to zero 
comes under the definition of an infinitesimal. No other constant 
whatever comes under the definition. 

There is one further point. We recall the definition of infinitesimal: 
for any given positive e, there exists a value of the variable x, 
such that for all subsequent values, |z | « e. It follows immediately, 
that in proving that a given variable z tends to zero, we can confine 
ourselves to considering only those values of x that succeed a cer- 
tain definite value of x, where this definite value can be chosen 
arbitrarily. 

Concerning this, it is useful in the theory of limits to add a rider 
to the definition of a bounded magnitude, viz, there is no need to 
demand that | y |< M for all values of y; it is sufficient to take the 
more general definition: a magnitude y is said to be bounded, if there 
exists a positive number M and a value of y, such that |y| < M for all 
subsequent values. 

The proof of the second property of infinitesimals remains 
unchanged with this definition of a bounded magnitude. For 
an enumerated variable, the first definition of a bounded magni- 
tude follows from the second, so that the second is not less 
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general. In fact, if | x, | < M for n >N, then denoting by M’ the 
greatest of numbers 


xl, | xl, ... [zu | and M, 


we can assert that | x, | < M’ + 1 for any m. 


27. The limit of a variable. We have called a variable an infini- 
tesimal, if its corresponding point K in the axis OX has on displace- 
ment the following property: on successive variation the length of 
the interval OK becomes, and on further variation remains, less 
than any given positive number e. We now suppose that this pro- 


perty is fulfilled, not by the interval OK, but by AK, where A 


S AKS 
0 a-e E 9 € Qt 
Frio. 42 


is a definite point on the axis OX with abscissa a (Fig. 42). In this 
case, the interval S'S of length 2e will have its centre at the point 
A, abscissa x = a, instead of at the origin, and the point K must 
come within this interval on successive displacement, then remain 
there on further displacement. We say in this case that the constant 
a is the limit of variable x, or that x tends to a. 


Noting that the length of AK is |a — a | [9], we can formulate 
the following definition: 

DEFINITION. The constant a is called the limit of the variable x, 
when the difference a — x (or x — a) is an infinitesimal. 

Having regard to the definition of an infinitesimal, a limit can be 
thus defined: 

DEFINITION. The constant a is called the limit of the variable x, 
when we have the following property : for amy given positive & there 
exists a value of x such that, for all subsequent values, | a — x |< e. 

We note some immediately obvious consequences of this definition, 
without dwelling on their detailed proof. 

No variable can tend to two different limits, and not every variable 
has a limit. For example, the variable sin a oscillates between —1 
and 1 on successive increase of the angle a, and has no limit. 

The limit of an infinitesimal is zero. 
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If x and y vary simultaneously, and each tends to a limit in the course 
of successive variation, whilst both always satisfy x « y, their limits 
a and b satisfy the condition a < b. 

We note here, that if the variables satisfy x < y, the sign of equality 
can be obtained for their limits, i.e. we have a < b. 

If x,y,2 vary simultaneously and always satisfy the condition 
x « y < z on successive variation, and if x and z tend to the same limit 
a, y also tends to the limit a. 

If a is the limit of x (or x tends to a), we write: 


lim xz = a, or «>a. 


If x tends to a, the difference x — a = a is an infinitesimal, and 
we can write: 
=at+a (2) 


i.e. every variable tending to a limit can be expressed as the sum of two 
terms: a constant term, equal to the limit, and an infinitesimal. Con- 
versely, if a variable x can be expressed in the form (2), where a is a 
constant, and a is an infinitesimal, the difference x — a will be an in- 
finitesimal, and hence, a is the limit of x. 

If the sequence x,, %,, ... tends to the limit a, every infinite subse- 
QUENCE Ln, Cn, . - . contained in the first sequence, also tends to a. In this 
subsequence, the subscript ng increases with increasing k and runs 
through some part of the set of positive integers. There is no 
analogous property, generally speaking, for a non-enumerated variable 
tending to a limit. 

We take as an example the variable x with the sequence of values: 


2, = 0.1, z, = 0.11, z = 0.111, ... 
n 


TT 
Lp =0.11...11,..., 


and we show that its limit is 1/9. We first form the difference 1/9 — xp: 


ART A A ES AE 
9 17 90" 9 “2900” 9 37 90007" 
AE 4 
9 n — 9.107 
The condition: 
1 
<E 
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is evidently equivalent to the condition: 
9 x10" > d or » > logy, 2 — logy) 9, 


and we can take N as the greatest integer contained in the difference 
log, p 1/s — log, y 9. In this example, the difference 1/9 — z, is a positive 
number for every n, i.e. x tends to the limit 1/9 whilst always remaining 
less than it. 

We now consider the sum of the first n members of the indefi- 
nitely diminishing geometrical progression: 


Sn = b+ bg + bg? + ... E bg" (0 < |g| « 1). 


As we know, 


— b(1-9") 
8, = Uleg . 
Setting n = 1, 2, 3, ..., we obtain the sequence: 


81, 835 8g, os Bj ee 


We have from the expression for sn: 


b bg" 


1—4 n l—q ` 





The right-hand side consists of the product of a constant b/(1 — q) 
and an infinitesimal g" [26]. Hence, using the second property of 
infinitesimals [26], the difference b/(1 — q) — s, is an infinitesimal, and 
we can say that the constant b/(l— q) is the limit of the sequence 
$1, Sq, +e + Ske 

Suppose that b > 0 and q < 0. The difference b/(1 — q) — sn is now 
positive for even n and negative for odd n, so that the variable sn 
is alternately greater than, and less than, the limit to which it tends. 

The same remarks apply in the case of magnitudes that tend to a 
given limit as were made in the previous paragraph, apropos mag- 
nitudes that tend to zero. 

Any constant, equal to the number a, comes under the definition 
of a variable, tending to the limit a. We note here, that a magnitude, 
all of whose values are equal to a, has in the ordinary way an infinite 
set of values, though all these values are equal to the same number. 
This view of a constant as a particular case of a variable comes in 
useful later on. 
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Furthermore, there is no need to consider all the values of a variable 
x when defining its limit; we need only take values subsequent to 
some arbitrarily given value. 

Another point: if a variable x tends to a limit a, it will differ from a 
by as little as is desired, after a certain initial moment of its varia- 
tion, and hence it is all the more a bounded variable. 

An ordered variable does not always have a limit, as already men- 
tioned. If we take, for example, the enumerated variable x, = 0.1; 
x = 0.11; z; = 0.111, ..., whose limit is 1/9, and the variable y, = 1/2; 
y, = 1/225; y¿=1/2%, ..., whose limit is zero, the enumerated variable 
2, = 0.1; 2, = 1/2; 24, = 0.11; 2, =1/2%; 2, = 0.111; z= 1/2*;...,does not 
tend to a limit. The sequence of its values z4, 2,, 2; ... has the limit 
1/9, and the sequence 2,, 24, 25 - - - has the limit zero. 


28. Basic theorems. 1. The limit of the algebraic sum of a finite 
number of variables is equal to the sum of their limits. 

For the sake of exactness let us take the algebraic sum x — y + z 
of three simultaneously varying magnitudes. We suppose that x, y 
and 2 tend respectively to limits a, b and c. We show that the sum 
tends to the limit a — b + c. 

We have by hypothesis [27]: 


x=a +a, y=b4B)2=c+y, 


where a, f, y are infinitesimals. We can write for the sum: 


x — y + z= (a + a) — (b + b)+ (c+ y) = 
= (a — b + c) + (a — B + y). 


The first bracket on the right-hand side of this equation is a con- 
stant, and the second is an infinitesimal [26]. Hence: 


lim (x — y 4-2) = a — b + c = lim z — lim y + limz. 


2. The limit of the product of a finite number of variables is equal to 
the product of their limits. 

We confine ourselves to the case of the product xy of two variables. 
We suppose that x and y vary simultaneously, tending respectively 
to limits a and b, and we show that xy tends to the limit ab. 

We have by hypothesis: 


x=a +a, y=b+8, 
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where a and f are infinitesimals; hence: 
xy = (a + a) (b + B) = ab + (af + ba + af). 


Using both of the properties of infinitesimals from [26], we see that 
the sum in the bracket on the right of this equation is an infinitesimal, 
and hence we have: 

lim (xy) = ab = lim x - lim y. 


3. The limit of a quotient is equal to the quotient of the limits, provided 
the limit of the denominator is not zero. 

We take the quotient x/y, and suppose that x and y tend simul- 
taneously to their respective limits a and b, where b 4 0. We show 
that x/y tends to a/b. 

To prove the theorem, it is sufficient to show that the difference 
a/b — x/y is an infinitesimal. By hypothesis: 

z=a +a; y=b+8 (b¥0), 
where a and B are infinitesimals. Hence: 
a =_ 1 7 
b y — b(b+6) 

The denominator of the fraction on the right of this equation is the 
product of two factors, and tends to b?. Thus, from some initial moment 
of its variation, it is greater than 67/2, the fraction as a whole being 
included between zero and 2/P?, i.e. the fraction is bounded. The term 
(a B — ba) is an infinitesimal. Hence [26], the difference a/b — x/y is 
an infinitesimal, and 





af — bo). 


lim x 
limy ' 





lim -5- = £L = 
y be 


The theorems proved are of fundamental importance in the theory 
of limits. The proofs have been given for the general case, and not for 
the case of enumerated variables, as when proving the properties of 
infinitesimals. But the remark we made when proving the first pro- 
perty of infinitesimals should be borne in mind. Take the case of a 
product. We take x and y as functions of some ordered variable ¢: = 
= x(t); y = y(t). Then x and y are themselves ordered variables. The 
same can be said of their product: w(t) = x(t) - y(t). The subscript 
plays the part of ¢ in enumerated variables, increasing through integral 
values. 

We remark further, that the above theorems establish the existence 
of the limit of a sum, a product and a fraction. For example, the third 
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theorem can be stated more fully as: if numerator and denominator 
tend to limits, and the limit of the denominator differs from zero, 
the quotient then tends to a limit, and this limit is the quotient of the 
limits of numerator and denominator. 

We note some consequences of these theorems. If x tends to the 
limit a, then ba*, where b is a constant and k a positive integer, tends 
to the limit ba“, in accordance with Theorem 2. 

Consider the integral polynomial 


f(z) = aya" +0 PIE MA A au EHF Om, 


with constant coefficients a,. Using Theorem 1 and the previous 
remark, we can say that this polynomial tends to a limit: 


lim f(x) = f(a) as x tends to a where f(a) = ay a" + a,07.. 








So FOOT A OE Am. (3) 
Similarly, as x tends to a, the rational fraction: 
ag @™ + a 2771 O. E H- On 
A Mu d e b, +b 
tends to a limit: 
lim q (2) = g(a), 6, oe dna on a) 


iboa? 4- b a9 71 s. by a+b, ’ 
if bya? + 0,0973... b, ,a + b, 0. 


AU these remarks are valid, in whatever way x tends to its limit a. 

We can of course take polynomials arranged in powers of several 
variables, all tending to limits, instead of polynomials arranged in 
powers of a single variable. 

For example, if lim z = a and lim y = b, then lim (2? + xy + y?) = 
= 02 + ab +- b. 


29. Infinitely large magnitudes. If the variable x tends to a limit, 
it is evidently bounded, as already remarked. We now consider some 
cases of variation of unbounded magnitudes. 

As before, we shall take along with x its corresponding point K, 
displaced on the axis OX. Let the point K move in such a way that, 
however large an interval TT we take, with the origin as centre, the 
point K will eventually be displaced outside it, and from then on will 
remain outside. In this case, x is an infinitely large magnitude, and 
tends to infinity. Let 2M be the length of the interval T"T. Recalling 
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that the length of the interval OK = | æ |, we can give the following 
definition: 

The magnitude x is said to be infinitely large, or to tend to infinity, 
if on successive variation of x, |x| becomes, and on further variation 
remains, greater than any given positive number M. In other words, 
the magnitude x is called infinitely large if it satisfies the following 
condition: given any positive number M, there exists a value of x such 
that, for all subsequent x, |x| > M. 

In particular, if x is infinitely large, and always remains positive 
during its successive variation as from a certain initial value (point K 
to the right of O), we say that x tends to plus infinity (+ °°). Similarly, 
if x remains negative (point K to the left of O), we say that x tends 
to minus infinity (— co). 

The following symbols are used for infinitely large magnitudes: 


lim x = co, lim z = + co, limz = — œ, 
or 
x > co q— oo q— — oo. 


The term "infinitely large" serves merely as a brief designation for 
the character of variation described above of the variable z, and here, 
as with the concept of infinitesimal, a distinction must be made between 
the concepts of “infinitely large" and “very large" magnitudes. 

If, for example, x takes the sequence of values 1,2,3,... then 
evidently, x—> + co. If its sequence of values is: —1, —2, —3, ..., 
then x—> — co. And finally, if the sequence is: —1, 2, —3, 4, ..., 
we can write: æ —- oo. 

Let us take as a further example the magnitude with the sequence 
of values: 


DPP,- 5, (> 1), (5) 
and let M be any given positive number. The condition 
qM 
is equivalent to 


and hence, if N is the greatest integer contained in the quotient 
logio M -log,, g, we have: 


>Mfrn>N, 


ie. the variable in question tends to + co. 
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If q is replaced by (—g) in the sequence (5), the only change is in 
the signs of odd powers of g, the absolute values of the members of 
the sequence remaining as before; thus, for negative g, with absolute 
value greater than unity, the sequence (5) tends to infinity. 

When in future we say that a variable tends to a limit, a finite limit 
is to be understood. It is occasionally said that a variable “tends to 
an infinite limit”, implying by these words an infinitely large magni- 
tude. 

An immediate consequence of the above definitions is: if variable 
x tends to zero, then m/z, where m is a given constant, differing from 
zero, tends to infinity; and if x tends to infinity, m/x tends to zero. 


30. Monotonic variables, The important thing is often to show that 
a given variable tends to a limit, without necessarily being able to 
discover what this limit actually is. We now outline an important 
test for the existence of a limit. 


K We yh A 
Fic. 43 


We suppose that the variable x is always increasing (more precisely 
never decreasing) or else always decreasing (more precisely, never 
increasing). In the first case, any given value is not less than all 
preceding values, and not greater than all subsequent values. In the 
second case, any given value is not greater than all preceding, and not 
less than all succeeding, values. We speak of monotonic variation in 
these cases. 

Point K on the axis OX, corresponding to x, is now displaced in 
a single direction, positively, if x increases, andnegatively, if x decreas- 
es. It is obvious at once that only two possibilities can arise: either K 
moves away indefinitely along the line (x—>-+ œ or — eo), or K 
indefinitely approaches some definite point A (Fig. 43), ie. x tends 
to a limit. If x is known to be bounded, as well as varying monotoni- 
cally, the first possibility drops out, and it can be asserted that the 
variable tends to a limit. 

This argument is based on intuition, and evidently lacks the force 
of a proof. We shall give the rigorous proof later. 

The above test for the existence of a limit is usually formulated as 
follows: if a variable is bounded and varies monotonically, it tends to a 
limit. 
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Take the example of the sequence: 


x at a a” 
AAA Ad 18) 
where x is a given positive number. 
We have: 
c 
Un = Usa are (7) 


For n > x, z/n is less than unity, and un < Un ie. from some 
initial value, un is always decreasing for n increasing, whilst remaining 
greater than zero. The variable thus tends to some limit u, in accor- 
dance with the test for the existence of a limit. Let the integer n inc- 
rease indefinitely in equation (7). We obtain in the limit: 


u=u:'0o0ou=0, 
i.e. 


a" 


lim =— = 0. (8) 


n>+ 


If we replace x by (—x) in sequence (6), the only change is in the 
sign of members with odd n, so that the new sequence also tends to 
zero, ie. equation (8) is valid for any given x, positive or negative. 

We obtain the limit in this example, after first showing that it 
exists. If we did not show its existence, our method could lead to a 
false result. Consider, for instance, the sequence: 





U = q, W = Gow  m=f,..la>i. 
We have obviously: 
Un = 4448. 
We denote the limit of u, by u, without troubling about its existence. 
On transition to the limit in the above equation, we obtain: 


u = ug, ie. u(1—q)=0, 
and hence, 
u = Q. 


But this result is false, since we know that for q > 1, lim q" = 


= + œ [29]. 


31. Cauchy’s test for the existence of a limit. The test for the 
existence of a limit given in [30] is a sufficient, but not a necessary 


+ The symbol n! called “factorial n”, is short for the product 1-2-3.. n. 
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condition that a limit exists, since, as we know [27], a variable can 
tend to a limit without varying monotonically. 

The French mathematician Cauchy gave a necessary and sufficient 
condition for the existence of a limit, which we shall now formulate. 
If the limit is known, it is characterized by the fact that, starting with 
a certain value of the variable, the absolute value of the difference 
between the limit and the variable is less than any given positive e. 
According to Cauchy's test, a necessary and sufficient condition for 
a limit to exist is that, starting from a certain value of the variable, 
the difference between any two successive values of the variable is 
less than any given positive e. We formulate this rigorously: 

CAUCHY'S TEST. A necessary and sufficient condition for a variable x 
to have a limit is that, given any positive number e, there exists a value 
of x such that, for any two successive values x’ and x”, we have | x’ —x"|<e. 

Suppose that we have the enumerated variable 


83, 09, +. Xp, > 


According to Cauchy's test, a necessary and sufficient condition for 
this sequence to have a limit is that, given any positive e, there exists 
an N (depending on e) such that 


[Lyn — Tp] < e, for m and n>N. (9) 


It is easy to show that this condition is necessary. 1f our sequence 
has the limit a, we write £m — x, = (£m — a) + (a — £n), whence it 
follows: 

2m — Ty < [2m q a| zu la we. Ta. 


But, by definition of a limit, there exists N such that |x, — a | < e[2 
and |a — z,| « e[2 for m and n >N, and therefore |x, — z4| < € 
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for m and n > N. To put the matter briefly, values of x lying arbitra- 
rily close to a lie arbitrarily close to each other. 

We avoid a rigorous proof at present of the sufficiency of Cauchy's 
test and give a descriptive explanation instead (Fig. 44). 
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Let M, be a point of the coordinate axis corresponding to the 
number z,. Suppose that condition (9) is fulfilled. In accordance with 
this condition, there exists a value N — N, such that 


LA E ty, «1 


for s > N,, i.e. every point M,, where s > N,, lies inside the interval 


A1A,, the length of which is equal to two and the mid-point of which 
corresponds to Xy. 
Similarly, there exists a value N = N, > N,, such that 


le, — zy < > for s >N}. 


We construct an interval, of length unity, with mid-point My,; 
and we let 434, be the part of this interval belonging to 414. 
By virtue of the two conditions above, the point M, must lie inside 
interval 42 A, for s > N, 

Similarly there exists N = N, >N, such that |r, py, |< 1/3 
for s > N,. We proceed as before, and construct AjA,, with length 
not exceeding 2/3 and belonging to A}A,, all values of M, being 
interior points of it for s > N,. Setting e = 1/4, 1/5,..., 1/n,. .., we ob- 
tain in this way a sequence of intervals A; An, each successive member 
of which is comprised in the previous member, whilst the length of 
the members tends to zero. The ends of these intervals obviously tend 
to the same point A, and the number a corresponding to this point 
is the limit of the variable x, since it follows from the construction 
described above that, for a sufficiently large value of s, all the points 
M, will lie as close as desired to the point A. 














As an application of Cauchy’s test, we take Kepler’s equation, which 
defines the position of a planet in its orbit. This equation has the form: 


z-gsinz--a, 


where a and q are given numbers, both lying between zero and unity, and 
æ is unknown. 
We take an arbitrary x, and construct a sequence of numbers: 


az,-q8inzg-- a, Ly = gSinz,-]-o,..., 
Ln = q BiN 24, tH a, np = 980 tnt 0, ... 
Subtracting the first equation from the second term by term, we obtain: 


; : o a z, 4-2 
2, — æ, = q(sin z, — sin zy) = 2q sin 22 cos E2, 
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Noting that | sin a| < | a| and | cosa] < 1, we have: 


|x, — z,| < 2g 


Bl ale — a (10) 


We can find in precisely the same way that 
| z3 — z| < g | 22 — vil, 
so that, using (10), we can write: 
| 23 — 2 | «qz, ml. 
Proceeding in this manner, we obtain for every n the condition: 
lena — Ep] < g" |e, — ol. (11) 
We now consider the difference x, — x4, taking m > n for the sake of 
elarity: 
Em — En = Ly — Vg, F Cm- — Oia F Em- Oi ttt pa — Vg 
Using (11), and the formula for the sum of the terms of & geometrical 
progression, we may write: 
lem — En) < l£m — Emil E [Una — Vaal + Em2 — Em- +e 
dmg — Bal < (477 + QP? -H gh? eee g”) [ay — m = 


= gg" 
= gq" —-— a, — Rol. 

q I q | 1 ol 

As n tends to infinity, q” tends to zero [26]; |x, — x, | is constant; the 
fraction (1—gq™~")/(1—gq) always lies between zero and 1/(1—g), i.e. is bounded, 
since, for m > n, g™—" lies between zero and unity. Thus, with indefinite 
increase of n, and any m > n, the difference z, — x, tends to zero, and 
condition (9) is fulfilled. We can say, in accordance with Cauchy’s test, that 
a limit exists: 
lim x,=f8. 


n--4- os 


We let n tend to infinity in the equation 
Pp = Sin 2, + a. 
Using the continuity of the function sin x,t we find in the limit: 
é=qsinéta, (12) 


ie. the limit £ of the variable z, is also the root of Kepler's equation. 

We started with an arbitrary x, in'constructing the sequence xp. We show, 
however, that Kepler's equation does not possess two roots, i.e. that lim x, = 
= ¿is independent of the choice of v, and is equal to the single root of Kepler’s 
equation. 


T The definition of continuity is given below [34]. 
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We assume there is a root é, in addition to the root &, so that: 


&, — q sin £, +a. 
Subtracting equation (12) term by term from tbis equation, we obtain: 


E, — E= q(sin E, — sin£) = 2q sin HE cos NEL ; 


whence, as before, 


l&r ilgli — 8]. 


But q lies between zero and unity, so that the above relationship is only 
possible for ¿, — ¿=0, i.e. £5, = £, and hence Kepler’s equation has only 
one root £. 


32. Simultaneous variation of two variables, connected by a functional 
relationship. We consider two variables x and y, connected by the 
functional relationship: 


y = f(z), 


and we let f(x) be defined to the left and right of the point x = c. 
We shall assume that x increases and passes through all real values 
as it tends to c, without in fact reaching c. In this case, f(x) is an ordered 
variable. We suppose that it has a limit A. 
This is usually written as follows: 
lim y= lim f(z)-— A, (13) 
x->c—0 xoc—0 
where the symbol z — c — 0 indicates that x tends to c from the side 
of lower values. 
Similarly, if x tends to c whilst diminishing and passing through all 
real values, and if f(x) now tends to the limit B, we write this as: 
lim y= lim f(x) = B. (14) 
x>c+0 x-c40 
The existence of the limit (13) is evidently equivalent to f(x) coming 
as close as desired to the number A, when x comes sufficiently close 
to the number c, whilst remaining less than c, i.e. (13) is equivalent 
to the following: for any given positive number e there exists a positive 
number y such that 
| A — f(x) | «< € as soon as 0<c—xu< s. 


Of course, y depends on e. 
In precisely the same way, (14) is equivalent to: for any given posi- 
tive number e there exists a positive number y such that 


| B —f(z)| < e as soon as 0<x—c< 9. 
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If limits A and B are equal, we write this as follows: 


lim y = lim f(z) = A. (15) 


It is immaterial here, whether x is on one side of c or the other, and 
(15) implies: for any given positive e there exists a positive y such that 


| A — f(x) | < as soon as |c—2xz|<7 and x zc. (16) 


Limit (13) is often denoted by the symbol f(x — 0) and limit (14) 

by fle + 0): 

lim f(z) =f(c— 0); lim f(z) = f(e + 0). 

x~>c—0 xoc+0 
Symbols f(c — 0) and f(c + 0) should be distinguished from f(c), i.e. 
the value of f(x) for x = c. This latter value can differ from f(c — 0) 
and f(c 4- 0), or in fact can be entirely meaningless. The limits f(c — 0) 
and f(c + 0) exist in the case of functions having graphs with no 
discontinuities, when we obviously have: f(c — 0) = f(e + 0) = f(c), 


ie. 
lim I(x) = f(e). 


We say in this case that the function f(x) is continuous for x= c 
(at the point x — c). We shall consider the properties of continuous 
functions in detail later. 

We return to the general case. The above definitions are easily 
generalized for the case when y tends to infinity. It is easy to see, 
for example, on the basis of what has been said, that 








: 1 A 1 
lim =—oo; lim = + oo, 
xo PC x>c+o PC 
lim tanz---co; lim tanz=— oo 
x+ 50 xo +0 


Taking the principal values of the function y = arc tan x [24], we 
can write: 





9. 
== 


lim arc tan 
x->c—0 


L— 


1 
z—<e 


lim arc tan 
x—c40 





za ct. 
==> + 
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If f(x) is defined for all sufficiently large x, the limit can exist: 
lim f(z) = A. 


X>+0 


If f(z) is defined for all x, either positive or negative, that are suffi- 
ciently large in absolute value, the limit can exist: 


lim f(x)= A. 
x~o 
The latter is equivalent to: for any given positive number e there 
exists a positive number M, such that 
|A —f(x)|<e for |z| > M. 
The following equations may easily be verified: 


lim g3 = -+ œ; lim 23 = — co; 





xo x>— o 
lim +. = 0; lim az? = +00; 
x— w c Xx>0 
1 
lim cs lim a -— 
3aà?p- z-Ll — 1 1 3" 
xX on X-> w se. — 
ST x + x? 
345 
2 
lim AL. =lim 2 = 
x— o x-> 00 a 


We also take an example from physics. Suppose that we heat a 
certain solid, and let f, be its initial temperature. The temperature 
of the body rises on heating, until the melting point is reached. The 
temperature now remains constant on further heating, till the point 
when the whole of the substance has passed over to the liquid state; 
after this, the temperature-rise begins again, in the resultant liquid. 
The situation is similar on passage from the liquid to the gaseous 
state. We shall consider the amount of heat Q communicated to the 
substance as a function of the temperature. Figure 45 shows the graph 
of this function, with temperature on the horizontal axis, and the 
amount of heat absorbed on the vertical axis. Let t, be the temperature 
at which transition to the liquid state begins, and t, the temperature 
at which the transition from the liquid to the gaseous state begins. 
Evidently: 

lim Q — ord. AB and lim Q = ord. AC. 


t+t,—0 t>t,4+0 
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The size of the segment BC gives the latent heat of fusion, and 


that of EF the latent heat of vaporization. 

If limits f(c — 0) and fic + 0) exist and differ, their difference 
Fc + 0) — f(c — 0) is called the break, or jump, of function f(x) at 
x = c (at the point x = c). 

The function y = arc tan l/(x — c) has a jump of z at x = c. The 
function Q(t) just considered has a jump equal to the latent heat of 
fusion at the melting-point t = ¢,. 

In defining the limit of f(x) as x tends to c, we assumed that x 
never actually coincides with c. This proviso is made, since the value 





Fie. 45 Fig. 46 


of f(x) for z — c either sometimes does not exist, or else has nothing 
in common with the values of f(x) for x close to c. The function Q(t), 
for example, is not defined for t = t. 

Another explanatory example may be given. We assume that a 
function is defined as follows in the interval (—1, +1): 


y=x+1 for —1<x=<O; 
y—r-—lfor 0<zx<1l;y=0fo x=0. 


Figure 46 shows the graph of this function; it consists of two straight 
sections, with their ends excluded (for x = 0), and a single isolated 
point, the origin. We now have: 


lim f(z)=1; lim f(z) — —1; f(0)=0. 
X>+0 


x>—0 
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33. Example. We consider an example that is important later on. 
We take 
sin Tt 

c 
This function is defined for all x, other than x — 0, for which both 
numerator and denominator become zero, so that the fraction loses 
its meaning. We shall see how y varies as x tends to zero. The magni- 
tude of the fraction does not change when x 
changes sign, so that it is sufficient to find the 
limit of the fraction as x tends to zero through 4 
positive values, ie. in the first quadrant. AGIN 
This limit exists, as we shall show. From the NS D 
above remarks, the same limit is obtained A 
for x tending to zero through negative values. 
We note that the theorem regarding the Fra. 47 
limit of a quotient cannot be used, since 
the denominator tends to zero as x —- 0. 


We shall take x as the angle subtended at the centre of a circle 
of unit radius. Measuring angle in radians, we have (Fig. 47): 





y = 


sin z — AC, z = -yare AB, tanz = AD, 


where AD is the tangent to the circle at the end of arc x. 
Since the length of the arc is intermediate between the length of 
the chord and the sum of the tangents, we can write: 
2sinx<2xr<2tanz, 
whence, dividing by 2 sin z, we have: 
x 1 
1< sin x < cos 2 


or 
1 > 2 > cosg. (17) 


x 





But as x tends to zero, cos x, given by the distance OC, evidently 
tends to unity, i.e. the variable sin 2/x always lies between unity and a 
magnitude tending to unity, and hence [27]: 

lim y =lim SB* 1. 
a0 z—0 * 

We determine for this case the number y, encountered in condi- 

tion (16). 
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Subtracting the three terms of (17) from unity, we have: 


sin x 


0c1—-—7—«1- cosi, 
and this shows that 


sinz 
zx 





- |<e if || — cos g| < e. 
Recalling that the sine of an arc in the first quadrant is less than 
the arc itself, we obtain: 
= zai. Eur esr un 
1 — cosz = 2 sin*=> <2(5) =7> 
and it is sufficient to choose: 
zt A 
> <e, ie |z|« Y2e 
Thus, /2e can act as 7 in the given case. 


34. Continuity of functions. We have already introduced the 
definition of the continuity of a function at the point x — c, if the 
function is defined both at the point and in the vicinity to left and 
right. We give the definition again. 

DEFINITION. The function f(x) is said to be continuous for x = c (at the 
point x = c), if a limit of f(x) exists for x — c and if this limit is equal 
to f(c): l 

lim f(z) = f(c) = f (lim a). (18) 
x=>C xc 


We recall that this is equivalent to the fact that there exist limits 
f(c — 0) and f(c + 0) to left and right, and to the fact that these limits 
are equal to each other and to f(c), ie. f(e — 0) = f(e + 0) = f(c). 
Alternatively, the definition given above is equivalent, as we have 
seen [32], to: for any given positive e, there exists a positive y such that 


fo) — f(a) | < e for le — z| « v. (19) 


It may be remarked that, in view of the arbitrariness of the choice 
of e, we can write |f(c) — f(x) | < e in place of | f(c) — f(x) |< e in 
this definition. This remark applies to all previous similar definitions, 
and in particular, to the definition of an infinitesimal and a limit, 
as also to the following equivalent definition of continuity. 

The difference z — c is the increment of the independent variable, 
whilst f(x) — f(c) is the corresponding increment of the function, 
so that the definition of continuity just given is equivalent to the 
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following: a function is said to be continuous at the point x = c, if to 
an infinitesimal increment of the independent variable (from the ini- 
tial value x = c) there corresponds an infinitesimal increment of the 
function. 

We note that the property of continuity, as expressed in equation 
(18), amounts to the possibility of finding the limit of the function 
by directly replacing the independent variable with its limit. 

We saw from formulae (3) and (4) [28], that polynomials in z and 
the quotients of such polynomials, i.e. rational functions of x, are 
functions continuous for any x, except those for which the denomin- 
ator of the rational function becomes zero. 

The function y = b is also obviously continuous, its value being 
the same for all x [12]. 

All the elementary functions, discussed in the first chapter (power, 
exponential, logarithmic, trigonometric and inverse circular), are 
continuous for all the x for which they exist, except those for which 
they tend to infinity. 

For example, log}, x is a continuous function of x for all positive x; 
tan x is a continuous function of x for all z, except 


w= (2b +1) 3, 


where k is any integer. 

Notice further the function w^, where u and v are continuous func- 
tions of z, u being assumed not to take negative values. This is also 
called an exponential function. It likewise has the property of con- 
tinuity, except for those z for which u and v are simultaneously zero 
or u = 0 and v < 0. 

We shall accept without proof what has been said about the con- 
tinuity of the elementary functions, although proof is of course requi- 
red, and can in fact be given with complete rigour. We shall later 
examine the question in detail. 

It can easily be shown that the sum or product of any finite number 
of continuous functions is itself a continuous function; the same is 
true of the quotient of two continuous functions except for those values of 
the independent variable for which the denominator tends to zero. 

We only consider the case of a quotient. We assume that functions 
g(x) and y(x) are continuous for x = a and that y(a) 4 0. We take 
the function 


_ plz) 
f (2) m p(x) . 
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Using the theorem. concerning the limit of à quotient, we obtain: 





oe eret 
. __ *-a 2 a 
mA we limy (x) p(a) Ha), 


which proves the continuity of the quotient f(x) for x = a. 

We note one simple example. If y = sin x is a continuous function 
of x, y = b sin x, where b is a constant, will also be continuous, being 
the product of the continuous functions y = b (see above) and y = 
= sin x. 

We turn again now to the function y = sin x/x. This is not defined 


for x — 0, but we know that lim y = 1. Hence, if we put y = 1 for 
x0 
x == 0, y will be a continuous function at the point x = 0. 


Such a process of finding the limit of a function for x tending to its 
point of indeterminacy is called disclosing the indeterminacy, and the 
limit itself, if it exists, is sometimes called a true value of the function 
at this point of indeterminacy. We shall have many examples later 
on of the disclosure of indeterminacies. 


35. The properties of continuous functions. We defined above the 
continuity of a function for a given value of x. We now suppose that 
the function is defined in a finite interval a < x « b. If it is conti- 
nuous for any given x in this interval, we say that it is continuous in the 
interval (a, b). We note here that continuity of the function at the 
ends of the interval, x = a and x = b, consists in: 


lim f(x) = f(a), lim f(x) = f(b). 
x>a+0 x+b—0 


All continuous functions have the following properties: 

1. If the function f(x) is continuous in the interval (a, b), there exists 
at least one value of x in this interval at which f(x) takes its maximum 
value, and at least one value of x for which the function takes its minimum 
value. 

2. If the function f(x) is continuous in the interval (a, b), with f(a) = m 
and f(b) = n, and if k is any number lying between m and n, there exists 
at least one x in the interval such that f(x) =k; and in particular, 
if f(a) and f(b) have opposite signs, there exists at least one x in the 
interval such that f(x) is zero. 

These two properties are immediately clear, if we note that the 
graph corresponding to a continuous function is a continuous curve. 
This remark cannot serve as a proof, of course. The concept itself of 
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a continuous curve, obvious at first sight, is seen to be unusually 
complex on closer inspection. The rigorous proof of the two properties 
mentioned, as also of the third, to follow, is based on the theory of 
irrational numbers. We accept these properties without proof. 

In subsequent paragraphs of the present section, we study the basis 
of the theory of irrational numbers and the relationship of this theory 
to the theory of limits and to the properties of continuous functions. 





Fra. 48 Fra. 49 


We may remark that the second property of continuous functions can 
also be formulated thus: on continuous variation of x from a to b, 
the continuous function f(x) passes at least once through every num- 
ber lying between f(a) and f(b). 

Figures 48 and 49 show the graphs of functions, continuous in the 
interval (a, b), for which f(a) < 0 and f(b) > 0. In Fig. 48 the graph 
cuts the axis OX once, and f(x) is zero for the corresponding x. There 
are three such values of x, instead of one, in the case of Fig. 49. 

We now pass to the third property of continuous functions, which 
is less obvious than the two previous ones. 

3. If f(x) is continuous in the interval (a, b), and if x = xis a certain 
value of x in this interval, by condition (19) [34] (replacing c by 24), 
for any given positive e there exists an y, of course depending on e, 
such that 

|f) — fw) | < e, iflz—sczlcn, 
it naturally being assumed that x also lies in this interval. (If, for 


example, x, = a, must be greater than a, and if xy = b, x < b.) 
But the number 7 can depend, not only on e, but also on just what 
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value of x = x, we take in the interval. The third property of con- 
tinuous functions consists in the fact that, for any given e, there 
exists the same 7 for all x, in the interval (a, b). In other words, 
if f(x) is continuous in the interval (a, b), for any given positive e there 
exists a positive y such that 


| f(x") — fe) |<e« (20) 


for any two values x” and x’ in the interval (a, b) which satisfy the in- 
equality 
[e= |< m. (21) 


This property is referred to as uniform continuity. Thus, if a function 
is continuous in an interval (a, b), it is uniformly continuous in this 
interval. 

We again remark, that we assume f(x) to be continuous, not only 
for all x inside the interval (a, b), but also for x =a and x = b. 

We shall further illustrate the property of uniform continuity by 
a simple example. We first rewrite the above inequality in another 
form, replacing the symbol x’ by x, and x” by (x + h). Now x” — a’ =h 
is the increment of the independent variable, and f(x + h) — f(z) is 
the corresponding increment of the function. The property of uniform 
continuity now becomes: 


Se +h) —fe)|«e if [Al <n, 


where x and (x + h) are any two points in the interval (a, b). 


Take the example of the function: 


F(x) = 2. 
We now have: 


Fe + h) — f(x) = (x +h)? — x* = 2xh + k. 


For any given x, the expression (2xh + k?) for the increment of our function 
obviously tends to zero, as the increment of the independent variable tends 
to zero. This is a further confirmation (cf. (34]) that the function in question 
is continuous for every x. It will be continuous, for instance, in the interval 
—l<2< 2. We show that it is uniformly continuous in this interval. We 
have to satisfy the inequality: 


[2th + à | <e (22) 


for suitable choice of y in the inequality | h | < y, where x and (x + h) 
must lie in the interval (—1, 2). We have: 


| 22A + R| < | 9xh | += 2 | e| | A | +A. 
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The maximum value of | x | in the interval is two, and hence we can rewrite 
the above inequality with greater force as: 


| 22h + h?|< 4| | +2. 


We shall always take | h | < 1. Then h? < | h |, and we can write the above 
in the form: 
|2zh + | <A hl] +l] 
or 
| 20h 4- 9 | «8|^l. 


The inequality (22) will certainly be satisfied, if we take | h | on the condi- 
tion 5| h | < e. Thus, h must satisfy two inequalities: 


|| < 1 and Jh] < E. 


We can thus take for y the least of the two numbers 1 and 2/5. For small 
£ (in fact, e < 5), we must take 7 = e/5, and this y is evidently the same, 
for a given e, for all x in the interval (— 1, 2). 

The property mentioned cannot obtain in the case of discontinuous func- 
tions, or those eontinuous only inside an interval. Take the function, the 
graph of which is shown in Fig. 46. It is defined in the interval (—1, 4-1) 
and has a discontinuity at « = 0. It has values as close as desired to unity, 
but it does not take the value unity, or values greater than unity. There 
is thus no maximum among the values of this function. Similarly, there is 
no minimum. The elementary function y == æ does not take either a maxi- 
mum or minimum value inside the interval (0.1). If it is considered in the 
closed interval (0,1), it reaches its minimum value at x = 0, and its maximum 
at x = 1. Take another function, f(x) = sin (1/x) continuous in the interval 
0 <a « 1, open on the left. As x tends to zero, the argument 1/x increases 
indefinitely, and sin (1/x) oscillates between (—1) and (+1), having no 
limit as x — +0. We show that this function is not uniformly continuous in 
the interval 0 < x < 1. We take two values: a” = lina and x” =2/(4n + 1)x, 
where n is a positive integer. Both values lie in the interval for any choice 
of n. Further, we have: 


J(x') = sin nz = 0; 


] (2^) = sin (2a + 3-7) zl. 
Thus: 
Jæ) f(z’) = 1 
2 1 
in+bDra na” 


and 





g” — g = 


As the positive integer n tends to infinity, the difference a’’ ~a’ tends to 
zero, whilst f(z'') — f(x’) remains equal to 1. It is thus evident that there 
does not exist a positive 7, such that, in the interval 0 < z < 1, (21) implies 
| fe) — f(x’) | < 1; this corresponds to choosing e= 1 in formula (20). 
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Take the function f(x) = x sin (1/x). The first term of the product tends 
to zero as > -+ 0, whilst the absolute value of the second, sin (1/x), does 
not exceed unity; hence [32], f(x) > 0 as > + 0. The second term has 
no meaning for « = 0, but if we complete the definition of our function by 
taking f(0) = 0, i.e. if we take f(x) = x sin (1/x) for 0 « x < 1 and f(0) = 0, 
we obtain a function continuous in the closed interval (0,1). The functions 
sin (1/z) and z sin (1/x) are evidently continuous for any «x, excepting zero. 


36. Comparison of infinitesimals and of infinitely large magnitudes. 
If a and B are two magnitudes, simultaneously tending to zero, the 
theorem regarding the limit of a quotient cannot be used for finding 
the limit of the ratio B/a. We shall assume that the variables a and p, 
whilst tending to zero, do not take the value zero. If the ratio B/a 
tends to a finite limit, differing from zero, the ratio a/f will also 
tend to a finite limit, differing from zero. We say in this case that 
B and a are infinitesimals of the same order. If the ratio fja has a limit 
at zero, we say that f is an infinitesimal of higher order in comparison 
with a, or that a is an infinitesimal of lower order in comparison 
with B. If the ratio fja tends to infinity, a/B tends to zero, i.e. f 
is of lower order compared with a, and a of higher order compared 
with B. It is easy to show that, if a and f are infinitesimals of the 
same order, and y is an infinitesimal of higher order compared with 
a, y is also of higher order as regards B. By hypothesis »/a > 0, and 
ajf has a finite limit, differing from zero. From the self evident equation 
y/B = y/a - ajf, and using the theorem regarding the limit of a pro- 
duct, it follows at once that y[B—-0, which proves our state- 
ment. 

We note an important particular case of infinitesimals of the same 
order. If a/8 — 1 (so that also f/a —- 1), infinitesimals a and f! are 
referred to as equivalent. It follows at once from the equation 





that the equivalence of a and B implies that the difference $ — a is an 
infinitesimal of higher order than a. Yt similarly follows from the 
equation 
B—a Le d 
go 





that their equivalence implies that 8 — a is an infinitesimal of higher 
order than £. 
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If ja", where k is a positive constant, tends to a finite limit, differ- 
ing from zero, we say that f is an infinitesimal of order k with respect 
to a. If B/a—>c, where c is a number, not zero, | Ble a | — l, ie. p 
and ca* are equivalent infinitesimals, and therefore, y = f — cal is 
an infinitesimal of higher order than f (or than ca‘). If a is taken as 
the basic infinitesimal, the equation f = ca + y, where y is an 
infinitesimal of higher order than co", represents the isolation from 
the infinitesimal f of the infinitesimal term ca“ (of the simplest 
form with respect to a), in such a way that the remainder is an 
infinitesimal y of higher order than f (or than ca"). 

An analogous comparison can be made of the infinitely large magni- 
tudes w and v. If v/u tends to a limit, finite and not zero, we say 
that u and v are infinitely large magnitudes of the same order. If 
v/u — 0, then ujv — œ. We say in this case that v is of a lower order 
of greatness with respect to u, or that u is of a higher order of greatness 
with respect to v. If v/u— 1, the infinitely large magnitudes are 
said to be equivalent. If v/u*, where k is a positive constant, has a 
limit, which is finite and not zero, we say that vis of the kth order 
of greatness with respect to u. All the above remarks about infinitesi- 
mals apply for infinitely large magnitudes. 

We further remark, that if the ratio fja or v/u has no limit at all, 
the corresponding infinitesimals or large order magnitudes are said 
to be incomparable. i 
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1. We saw above that 





xo T 


ie. sing and x are equivalent infinitesimals, and therefore sin x — z is 
an infinitesimal of higher order than x. We see later, that this difference is 
equivalent to — 2/6, i.e. it is an infinitesimal of the third order with respect 
to x. 

2. We show that the difference 1 — cos x is an infinitesimal of the second 
order with respect to x. We have in fact, on using a well-known trigonometric 
formula and with a simple rearrangement, 








2 sin? l x sin 1 x : 
1 — cosa 2 1 2 
zi a? 2 1 
—x 


74 FUNCTIONAL RELATIONSHIPS AND THE THEORY OF LIMITS [38 


If x > 0, a = 2/2 also tends to 0, and as we have shown: 


i MP 
Beg * sina 
n= = lim 222. = 1, 
x+0 M axo a 
2 


and hence, 
lim l —coszx = d 


x>0 x 2 


ie. in fact, 1 — cos 2 is an infinitesimal of the second order with respect 
to zx. 
3. From the expression 





f1cz41 
we have: 
VI-a—1 _ 1 
c Vl+a+1 
whence 
lim Veet o. 
x70 c 2 


ie. |1--z-— 1 and z are infinitesimals of the same order, yı T2—1 
being equivalent to 2/2. 


4. We show that a polynomial of degree m > 1 is an infinitely large magni- 
tude of order m with respect to x. In fact, 


m m-—1 
lim 202 Tac To an aX + am E 
xo am 





. a Am- am 
A pe te] 


It can easily be seen that two polynomials of the same degree are infinitely 
large magnitudes of the same order, for x — oo. The limit of their ratio is the 
ratio of the coefficients of their highest terms. For example: 





1 3 
5z?--2 —3 NEC eer wears BEES 
X- 0 Tal 4-22 4- 4 x>0 2 4 BS 


If the two polynomials are of different degree, the one of higher degree is 
an infinitely large magnitude of higher order with respect to the other, for 


cm co, 


38. The number e. Our present example is important later on: we 
consider the variable taking the values 


E 
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where n tends to -]-oo, increasing through positive integers. Using 
Newton’s binomial formula, we obtain: 




















peda 
a 
pop DO ya 4 
A a 
=14+ 143 (10) +30) (1-0 + + 
Hd 363) ost) 


ex6-3)6-3)- 6-53) 


The sum written above contains (n + 1) positive terms. As the 
integer n increases, the number of terms increases and each term 
itself also increases, since in the expression for the general term: 


x: -3)06-39--6- 222 


k! remains unchanged, whilst the differences in brackets increase 
with increasing n. We thus see that the variable in question increases 
with increasing n; so that it is sufficient to show that the variable 
is bounded, in order to prove that its limit exists. 

We replace all the differences appearing in the general term by 
unity, and all the factors of kl, starting with 3, by 2. The general 





1 k— 


T The product ( — x) (2 — i. — 


n (n — 1) (n —2)...(n — k + 1) 
m3 

n in the denominator, each of the k terms of the product on top is divided 

by n. 





l ) is obtained from the 





fraction if, noting that there are k terms 
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term is evidently now increased, and we shall have, on using the 
formula for the sum of the terms of a geometrical progression: 


(2 o = 


1 
1-7 1 
e Se et < 3, 
a 
ie. the variable (1 + 1/n)' is bounded. We denote its limit by the 
letter e: 
. 1 y : Ls . 
lim (1 + a) =e (n is a positive integer). (23) 
n> foo n 
This limit is evidently not greater than 3. 
We now show that the expression (1+ l/æ)" tends to the same 
limit e, if x tends to +09, taking any values. 
Let n be the greatest integer included in 2, i.e. 


n<x<n-+l. 


The number n evidently tends to + along with x. On noting 
that a power term increases, both with increase of the positive base, 
greater than unity, and with increase of the exponent of the 
power, we can write: 


(«06-3644 (24) 
But by equation (23): 


in (ipods emer 


pepe SEE cert 
and 
sin (04 2)" = tm [oe eje 


Thus, the extreme terms of inequality (24) tend to the limit e, 
and hence the middle term must tend to the same limit, i.e. 


lim t + iy — e. (25) 


X>+ 0 


We now consider the case when x tends to —oo. 
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We introduce a new variable y in place of x, putting 


gz = —l — y, 
whence 
y= —l — r. 


It is evident from the last equation that y tends to +% as x tends 
to —oo, 

On changing the variables in the expression (1 + 1/2)" and noting 
equation (25), we obtain: 


lim (1 + 1) = lim (—2 y 








x y -cl-y 
ly y 
ada [o eds Doa hy oca eee 
ye y y>+ o V y 


If x tends to co, with either sign, ie. | x | — + co, it follows from 

the above that here also: 
š 1 Y 
un E + +) =e. (26) 

We shall later give a suitable method for calculating e to any 
degree of accuracy. Clearly, it is an irrational number; we have, 
to an accuracy of seven decimal places: e = 2.7182818... 

We can now easily find the limit of (1 + k/x)*, where kis a given 
number. Using the continuity of a power function, we obtain: 

: k y A 1 wk . Dpw* ^ 
li Ee) sm qe) peg 
where y denotes x/k, and tends to infinity along with «x. 

An expression of the form (1+ k[n)' is encountered in compound 
interest theory. 

We suppose that an increment of capital occurs annually. If capital a 
returns an interest annually of p per cent, the accumulated capital 
in the course of a year will be: 

a(l +k), 
where 
DEBA 
k = 100 " 
after another year has elapsed, it will þe: 


al + E); 
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and in general, after the lapse of m years, it will be: 
a(l + E)". 


We now suppose that the increment of capital takes place every 
l/n of a year. The number k is now diminished n times, since the 
percentage interest is counted over a year, whilst the number of 
intervals of time is increased n times; so that the accumulated capital 


over m years will be: 
k mn 
a (2 + =) . 


Finally, let n tend to infinity, ie. an increment of capital occurs 
in every smallest possible interval of time, and in the limit, con- 
tinuously. After the lapse of m years, the accumulated capital will be: 

j k E , k nym = km 
o lea aem 

The number e is used as a base of logarithms. These are referred 
to as natural logarithms and are here denoted by the simple sign 
log without indicating the base. 

For x tending to zero, both numerator and denominator in the 
expression log (1 + x)/x tend to zero. Let us examine this indeter- 
minate form. We introduce a new variable y, putting 

neum Mine aes 
whence evidently, as xz —- 0, y tends to infinity. Substituting the 
new variable, and making use of the continuity of a logarithm and 
formula (26), we obtain: 


x>0 zx 


im log(l+2) 15 d. — ]i ly. = 
lim 2817 m y log (1 + y) m log (1 + y) loge=1. 
The advantage of the present choice of a base of logarithms is 
clear from this. Just as, using radian measure of angles, the true 
value of (sin z)/zis unity for x=0, in the case of natural logarithms the 


true value of log (1 + x)/x is also unity for x = 0. 
The following relationship follows from the definition of logarithm: 


N = ge, 


Taking logarithms to base e in this equation, we obtain: 


log N = log, N -loga or log, N = log N- aga ^ 
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This relationship gives the logarithm of a number N to any base a 
in terms of its natural logarithm. The factor M = 1/log ais called 
the modulus of the system of logarithms to base a, and for a = 10 
it is given with an accuracy of seven decimal places by: 


M = 0.4342945 ... 


39. Unproved hypotheses. We left several hypotheses unproved, in 
dealing with the theory of limits, and we now give them again: the 
existence of a limit for a monotonie bounded variable [30], the 
necessary and sufficient condition for the existence of a limit (Cauchy's 
test) [31] and the three properties of a function continuous in a 
closed interval [35]. The proofs of these hypotheses are based on the 
theory of real numbers and the operations on them. The following 
paragraphs will deal with this theory, and with the proofs of the 
above hypotheses. 

We introduce another new concept, and formulate a further hypo- 
thesis, the proof of which will also be given below. If we have a set, 
consisting of a finite number of real numbers (for example, we might 
have a thousand real numbers), there will be both a maximum and 
a minimum amongst them. On the other hand, if we have an infinite 
Set of real numbers, such that, furthermore, all the numbers belong 
to a definite interval, there will not always be amaximumand minimum 
amongst them. For example, if we take the set of all real numbers, 
lying between 0 and 1, but at the same time exclude from this set 
the numbers 0 and 1 themselves, there will be neither a maximum 
nor a minimum in the set. Whatever number we take, near to but less 
than, unity, we can always find a second number, lying between 
this first number and unity. The numbers 0 and 1 in this case, whilst 
not belonging to our set of numbers, have the following property 
in relation to it: there is no number greater than unity among the 
set, but for any given positive number e there exists a number greater 
than (1 — e). Similarly, there is no number less than zero among 
the set, but for any given positive number e there exists a number 
less than (0 + e). The numbers 0 and 1 are called the strict lower 
and strict upper bounds of the set of real numbers in question. 

We pass from this example to the general case. Let E be some 
set of real numbers. We say that it is bounded above, if there exists 
a number M, such that all numbers belonging to E are not greater 
than M. Similarly, we say that the set is bounded below, if there 
exists a number m, such that all numbers belonging to E are not 
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less than m. If a set is bounded above and below, we simply say 
that it is bounded. 

DEFINITION. The strict upper bound of a set E is defined as the 
number B (if it exists), such that no member of E is greater than p, 
whereas, for any given positive e, there is a member greater than (B — e). 
The strict lower bound of the set is defined as the number a (if it exists), 
such that no member of E is less than a, whereas for any given positive 
e there is a member less than (a + e). 

If the set H is not bounded above, i.e. if there exists a member 
of E greater than any given number, the set cannot have a strict 
upper bound. Similarly, if E is not bounded below, it cannot have 
a strict lower bound. If there exists a maximum among the numbers 
of the set, this is evidently the strict upper bound of the set. Similarly, 
if there is a minimum among the numbers of the set, this is the strict 
lower bound of set E. But as we have seen, there is not always a 
maximum or minimum among the members of an infinite set. It can 
be shown, however, that there is always a strict upper bound for a 
set bounded above, and always a strict lower bound for a set bounded 
below. We also note a direct consequence of the definition of strict 
bounds, that the strict upper and strict lower bounds must be unique. 

Later on, we shall often make use of the hypotheses indicated in 
the present paragraph. The next paragraph, in small print, can be 
omitted at a first reading. 


40. Real numbers. We begin by dealing with the theory of real numbers. 
We start out from the set of all rational numbers, integral and fractional, 
positive and negative. All these rational numbers can be arranged in increas- 
ing order. If this is done, and a and 6 are any two distinct rational numbers, 
any desired number of rational numbers can be found between them. Let 
a<b, and let us introduce the positive rational number r=(b-— a)/n, 
where n is any positive integer. The rational numbers a + r, a + 2r, a + 3r, 
... a + (n — 1)r lie between a and b, and in view of the arbitrariness of 
choice of the integer n, our statement is proved. 

We define a section in the domain of real numbers as any division of all 
rational numbers into two classes, such that any number of one (the first) 
class is less than any number of the other (second) class. Evidently, in this 
case, if a given number belongs to the first class, every number less than 
it also belongs to the first class, and if a given number belongs to the second 
class, every number greater than it also belongs to the second class. 

We suppose that there is a greatest among the numbers of the first class. 
In this case, by the property mentioned of the set of rational numbers, 
it can be asserted that there is no minimum among the numbers of the second 
class, Similarly, if there is a least among the numbers of the second class, 
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there is no greatest among the numbers of the first class. We call the section 
a section of the first kind, if there is a greatest among the numbers of the 
first class, or if there is a least among numbers of the second class. It is easy 
to construct such a section. We take any rational number b and place all 
rational numbers less than b in the first class, and all rational numbers 
greater than b in the second class, whilst b itself is placed either in the first 
class (where it will be the greatest) or in the second class (where it will be the 
least). Taking every possible rational number as b, we obtain every possible 
section of the first kind. We shall say that such a section of the first kind 
defines the rational number 5, this being the greatest in the first, or the least 
in the second, class. 

But there exists a section of the second kind, where there is no greatest 
in the first, or least in the second, class. We construct one such section, as 
an example. We put in the first class all negative rational numbers, zero, 
and those positive rational numbers whose square is less than two, whilst 
we put in the second class all those positive rational numbers whose square 
is greater than two. Since there is no rational number whose square is equal 
to two, all the rational numbers are now assigned, and we have a certain sec- 
tion. We shall show that there is no greatest number in the first class. To 
do this, it is sufficient to show that, if a belongs to the first class, there 
exists a number greater than a, also belonging to the first class. This is evident 
if a is negative or zero; so we suppose a > 0. Since it belongs to the first class, 
a? < 2, We bring in a positive rational number r = 2— a^, and we show 
that a positive rational number 2 can be defined, small enough for (a + x) 
also to belong to the first class, i.e. for us to have the inequality: 


2— (a+ x3?>0 or r— 2ax—a>0, 


ie. it amounts to us finding a positive rational number which satisfies the 
inequality: 
az? + 2ax <r. 


Taking x < 1, we have x? < x, and hence, a? + 2ax < x + 2ax = (2a + b x 
i.e. it is sufficient for us to satisfy the inequality: 
(2a + l) <r, 
so that x is defined by the two inequalities: 
r 
vcl ane SEI , 

We can evidently find as many positive rational numbers æ as desired, 
satisfying both these inequalities. It can be shown in precisely the same way, 
that there is no least number in the second class of our section. We have thus 
constructed an example of a section of the second kind. The following hypo- 
thesis is & turning-point in the theory: we suppose that every section of the 
second kind defines a certain new entity, i.e. an irrational number. Different 


sections of the second kind define different irrational numbers. In the above 
example of a section of the second kind, it is easy to surmise that the irrational 


number defined is that usually denoted by y2. 
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We can now arrange all the irrational numbers thus introduced, together with 
the previous rational numbers, in increasing order, as is done intuitively with 
the points of the directed axis OX. If a is a certain irrational number, 
we denote the first and second classes of the section which defines it by 
I(a) and Il(a) respectively. We reckon a as greater than any number of 
I(a), and less than any number of II(a). Any irrational can thus be compared 
with any rational number. It remains to define the concepts of greater than 
and less than for any two distinct irrational numbers a and f. Since a and 
B are distinct, classes I(a) and I(8) do not coincide, and one class is contained 
in the other. We suppose that I(a) is contained in I(f), i.e. every number 
of I(a) belongs to 1(8), whilst there is a number of I(f) belonging to II(a). 
We take it that here, by definition, a < f. In this way, the set of all 
rational and irrational numbers, or in other words, the eet of all real 
numbers is arranged in order. Using the definitions given above, it is now 
easy to show that, if a, b, c are real numbers, and a « b, b « c, then 
a «c. 

We note above all one elementary consequence of the definitions given. 
Let a be a certain set number. Since there is no greatest number in 
elass I(a), and no least number in class II(a), it is immediately evident that 
any desired number of rational numbers may be set between a and any 
given rational number a. Now let a « f be two distinct irrational numbers. 
Part of the rational numbers of I(f) enter into II(a), and hence it immediately 
follows that it is also possible to place any desired number of rational numbers 
between a and f, i.e. in general, any desired number of rational numbers can 
be placed between two distinct real numbers. 

We now pass to the proof of the basic theorem of the theory of irrational 
numbers. We take the aggregate of all real numbers and make some section 
of it, i.e. we assign all real numbers, both rational and irrational, to two 
classes I and II, such that any number of I is less than any number of II. 
We show that there must now be either a greatest number in class I, or a 
least number in class II (one excludes the other, as above, for a section of 
the domain of rational numbers). For this, we denote by I' the set 
of all rational numbers of I, and by II’, the set of all rational numbers 
of II. Classes 1” and II' define a certain section of the domain of rational 
numbers, and this section defines a real number a (rational or irrational). 
We suppose for clarity that this a lies in class I, with the above assignment 
of all real numbers into two classes. We show that a must be the greatest 
number of class I. If this were untrue, there would exist a real number 
B of classI, greater than a. We take some rational number r, lying between 
a and f, ie. a<r< B. It must belong to class I, and hence, to class I’. 

Thus, the number r, greater than a, is in the first class of the section 
(1”, IT’) defining a. But this is impossible; and hence our assertion that a 
is the greatest number of class I must be true. It can similarly be shown, 
that if a belongs to class II, it must be the least number there. 

We have thus proved the following basic theorem: 

FUNDAMENTAL THEOREM. For any section, made in the domain of real 


numbers, either the first class must contain a greatest number, or the second class 
a least number. 
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A simple geometrical meaning can easily be attached to all the arguments 
of the present paragraph. To start with, we take on the axis OX only the points 
with rational abscissae. A section in the domain of rational numbers cor- 
responds to cutting OX into two semi-axes. If the cutting is at a point with 
a rational abscissa, the section obtained is of the first kind, the abscissa 
of the point of cutting being itself counted as either in the first, or in the second, 
class. If the cutting occurs at a point for which there is no corresponding 
rational abscissa, a section of the second kind is obtained, defining an irrational 
number which is taken to be the abscissa of the point of cutting. After fil- 
ling in such empty points with irrational abscissae, every division of the axis 
now occurs at a point with a certain real abscissa. All this is mere geometrical 
illustration, and lacks the force of a proof. Using the definition given of an 
irrational number a, it is easy to form the infinite decimal fraction correspond- 
ing to this number [2]. Every finite piece of this fraction must belong to 
I(a), but if the last figure of this piece is increased by unity, the resultant 
rational number must belong to II(a). 


41. The operations on real numbers. The theory of irrational numbers 
contains, in addition to the definitions and the basic theorem given above, 
definitions of the operations on irrational numbers, and a study of the pro- 
perties of these operations. We shall define the operations by making use 
of sections in the domain of rational numbers; moreover, since these sections 
define not only irrational, but also rational numbers (sections of the first 
kind), the definitions of the operations will suit all real numbers in general, and 
will coincide with what is known in the case of rational numbers. Our dis- 
cussion is limited to general remarks in the present section. 

We make a preliminary lemma. Let a be a certain real number. We take 
some (small) positive rational number r, then a rational number a of I(a), 
and form the arithmetic progression: 


a,atr,a+ 2r,...,a+nr,... 


For n large, (a + nr) belongs to II(a); hence, there will exist a positive 
integer k, such that [a + (k — 1) r] lies in I(a), and (a + kr) lies in II(a), 
i.e.: 

LEMMA. For any given section of the rational numbers, there exist numbers 
in the two classes differing by any given positive rational number r, however 
small. 

We now pass to defining addition. Let a and f be two real numbers. Let 
a be any number of I(a), a^ be any number of I1(a), b any number of I(f), 
b’ any number of II(f). We form all the possible sums (a + b) and (a' + b’). 
In every case we have: a + b <a’ + b'. We carry out a new section of the 
rational numbers, putting all rationals greater than all (a + b) in the second 
class, and all remaining rationals in the first class. Any number of the first 
class is now less than any number of the second class, all the numbers (a -+ b) 
falling in the first class, and all (a’ + b’) in the second class. This new section 
defines a certain real number, which we call the sum (a + f). This number 
is evidently greater than or equal to all (a + b), and less than or equal to 
all (a^ + b’). Noting that, by the above lemma, numbers a and a”, as also 
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b and 0’, can differ from each other by any given small positive rational 
number, it is easy to see that there can exist only one number, satisfying the 
above-mentioned inequality. We can at once verify that addition satisfies 
the ordinary laws, known for rational numbers: 


at f=BH a (a+ B)+y=a+(B+ yp); a+ 0=a. 


For example, to obtain (f + a), we should have to construct the sums 
(b + a) and (6’ + a”) instead of (a + b) and (a' + b’), in which case the 
former sums coincide with the latter, since addition of rational numbers 
is known to obey the rule of transposition. 

Let a be a certain real number. We define the number (—a) by the section 
carried out as follows: we put in the first class all the rational numbers of 
II(a) with changed sign, and in the second class, all numbers of I(a) with 
changed sign. This is in fact & section in the domain of rational numbers; 
and we have for (—a), as is easily verified: 


— (—a)= a; a + (—a) =0. 


Clearly, if a < 0, (—a) > 0, and conversely. Provided ax0, we say that 
whichever of the numbers a and (—a) is greater than zero is its absolute 
value. As before, | a | denotes the absolute value of a. 

We now pass to multiplication. Let a and f be two positive real numbers, 
ie. a> 0 and f > 0. Let a be any positive number of I(a), b be any positive 
number of I(B), and a’, b’ any numbers of II(a) and II(B) (they must also be 
positive). We construct a new section, putting all rational numbers greater 
than all products ab in the second class, and all the remaining rational num- 
bers in the first class. All ab fall in the first class, and all a’ b’ in the second 
class. The new section defines a certain real number, which we call the pro- 
duct af. This number is greater than or equal to all ab and not greater than 
all a'b’, and there is only one real number satisfying these inequalities. 

In the case of negative a and/or f, we carry out multiplication as in the 
previous case, whilst introducing into the definition of multiplication the 
usual rule of signs, i.e. we put af = + |a || B], taking the (+) sign if both 
a and f are less than zero, and the (—) sign if one is greater than zero, and 
the other less than zero. 

We take as the definition of multiplication by zero that a - 0 = 0 * a = 0. 
The basic rules of multiplication are at once verified: 


af = fa; (aB)y = alfy); a(B + y) = aß + ay, 


and the product of & given number of factors can be zero if, and only if, 
at least one of these factors is zero. 

Subtraction is defined as the converse operation to addition, i.e. a — f = x 
is equivalent to x + f = a. Adding (—) to both sides of this equation, we 
obtain by the properties of addition given above: % = a-L-(—), ie. the 
difference has to be defined in accordance with this formula, so that the opera- 
tion of subtraction leads to addition. It remains to verify that the expression 
obtained for z actually satisifies the condition x + B = a, but this immediately 
follows from the properties of addition. We shall see how the usual property 
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is justified, that the inequality a > f is equivalent to a — p > 0. Before 
this, we turn to division, defining the reciprocal of a given number. If a is a 
rational number, differing from zero, the number l/a is called the 
reciprocal. Let a be a real number, not zero. Let a> 0 at first, and let 
a’ be any number of TI(a) (it is rational and positive). We define the reciprocal 
of a by the following section: we put all negative numbers, zero, and l/a, 
in the first class, and the remaining numbers in the second class. Let a certain 
positive number c, belong to the first class of the new section. This means 
that c, = (1/a,), where a; is in II(a). We take any positive rational number 
€; < er It can be written as €, = (Ya; ), where a, is rational and a. « a. ; 
i.e. a, also belongs to II(a). In other words, if a certain positive number 
belongs to the first class of the new section, every smaller positive rational 
number also belongs to the first class. Also, all negative numbers and zero 
belong there, by hypothesis. Hence it is clear, that in forming the section 
that defines the reciprocal of a, we maintain the basic hypothesis that any 
number of thè second class is greater than any number of the first class. This 
number, the reciprocal of a, is denoted by the symbol 1/a. 
If a < 0, we define the reciprocal by the formula: 


1 1 


a |a! 


Using the definition of multiplication, we obtain: 


a: L =1. 
a 

We now turn to division. This is the inverse operation to multiplication, 
i.e. a: f = x is equivalent to rf = a, and, as in subtraction, it is easy to 
see that, if f 4 0, the quotient obtained: v = a - 1/8 is unique; in this way, 
division leads to multiplication. Division by zero is impossible. 

A number is raised to a positive integral power by multiplication. Extract- 
ing a root is defined as the inverse operation to raising to a power. Let a 
be a real positive number, and n a given integer, greater than unity. We form 
the following section of the rational numbers: we put in the first class all 
negative numbers, zero, and all positive numbers whose nth power is less 
than a, and all remaining numbers go into the second class. Using the defini- 
tion of multiplication, it is easily seen that the positive number f, defined 
by this section, satisfies the condition: f" = a, i.e. B is the arithmetic value 
of the root Ya. If n is even, there will be a second value (— f). The root of odd 
degree of a real negative number is analogously defined (there is a unique 
answer). Exponential functions will be discussed in full detail later. We now 
note the following important result: having orice justified the basic laws of 
operations, all the rules and identities of algebra are simultaneously justified, 
if letters are understood to represent real numbers. 


42. The strict hounds of numerical sets. Tests for the existence of a limit. 
We now prove the theorem regarding the strict bounds of a set of real num- 
bers, formulated in [39]. 
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THEOREM. If a set E of real numbers is bounded above, it has a strict upper 
bound, and if E is bounded below, it has a strict lower bound. 

We restrict the proof to the first part of the theorem. By hypothesis, all 
numbers of # are less than a certain number M. We form a section of the real 
numbers as follows: we put all numbers greater than all numbers of E in 
the second class, and the remaining real numbers in the first class. We have in 
the second class, for example, all numbers (M + p), where p > 0, and in the first 
class, for example, all numbers of E. Let f be the real number defined by 
the section made. By the basic theorem of [40], it will be the greatest in 
the first class, or the least in the second class. We show that f is in fact the 
strict upper bound of E. Firstly, there is no number among E greater than 
8, since all numbers of E are in the first class. Further, there certainly exists 
a number of E, greater than (f — e) for any e > 0, since, if there were no 
such number, ($ — ¢/2) would be greater than all numbers of E and would 
have to belong to the second class, whereas it is actually less than f and is 
in the first class. The theorem is thus proved. Clearly, if f belongs to E, 
it will be the greatest of the numbers of E. 

We now prove the existence of a limit for a monotonic bounded variable 
[30]. Let the variable x be continually increasing, or at least, not decreasing, 
ie. every one of its values is not less than any previous value. Furthermore, 
let x be bounded, i.e. there exists a number M, such that all values of x are less 
than M. We consider the set of all values of x. By the theorem just proved, 
there exists a strict upper bound f of this set. We show that p is the limit 
of x. Let e be an arbitrary positive number. By the definition of strict upper 
bound, there is a value of z greater than (f — e). Then, since x is monotonic, 
all subsequent values of z are greater than (f — e), whilst on the other hand, 
they cannot be greater than f; and since e is arbitrary, it follows that 
B = lim 2. The case of a decreasing variable can be worked out in precisely 
the same way. 

We prove a preliminary theorem, before passing to the proof of Cauchy’s 
test [31]. 

THEOREM. Given a sequence of finite intervals : 


(ai, 53), (a5, 55), ©- -s (An, bn), ... 


where each successive interval is contained in the previous one, 1.8. a, 4, > an 
and b,,, < bn, and given also that the lengths of these intervals tend to zero, 
le. (b, — a4) > 0, then the ends of the intervals, a, and b,, tend to a common 
limit, with increasing m. 

We have by hypothesis: a, < a, < ..., with also a, < b, for any value of n. 
The sequence a,, az, ... is thus monotonic and bounded, and hence has a 
limit: a, > a. Since (b, — a,) > 0 by hypothesis, we have: b, = an + €n, 
where £, > 0, and hence b, also has a limit, equal to a. 

We now turn to the proof of Cauchy's test. We confine ourselves to the 
case of an enumerated variable: 


yy Uy 4. Xp sse (27) 


We have to show, that the necessary and sufficient condition for the existence 
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of a limit of sequence (27) is as follows: for any given positive e there exists 
a subscript N, such that 


| 2m — | <e for m and n> N. (28) 


We show that the condition is sufficient, i.e. if it is satisfied, sequence (27) 
has a limit. It follows from our previous discussions [31], that if the condi- 
tion is satisfied, we can construct a sequence of intervals: 


(a4, by), (Az, ba), $e (Aj, Ba)» red 


with the following properties: each successive interval is contained in the 
previous one, the length (b, — a,) tends to zero, and for every (a,, bp) there 
is a corresponding positive integer N,, such that all x, for s > N, lie in 
(a4, bx). These (a, b,) are the sections A; A, of [31]. By the theorem above, 
we have a common limit: 
lim a, = lim 5, = a. (29) 
k- co Kk oo 

We show that a is the limit of sequence (27). Let e be a given positive 
number. By (29), there exists a positive integer J, such that (a,, bj) and all 
subsequent intervals lie inside the interval (a — e, a + e). 

Hence it follows that all the numbers x, for s < N, also belong to this 
interval, ie. |a — c, | < e for s > N,. Since e is arbitrary, we see that a 
is the limit of sequence (27), and the sufficieney of condition (28) is proved. 
The necessity of the condition was proved earlier [31]. The proof remains 
valid for a non-enumerated variable. 


43. Properties of continuous functions. Turning to the properties of con- 
tinuous functions that were given earlier [35], we first prove an auxiliary 
theorem. 

THEOREM l. Given f(x) continuous in an interval (a, b), and e any positive 
number, the interval can be subdivided into a finite number of new intervals 
in such a way that | f(x.) — f(2,) | < e, provided x, and x, belong to the same 
new interval. 

We shall prove this theorem by reductio ad absurdum. We suppose that 
it is impossible to subdivide (a, b) in the way described. We divide our inter- 
val at the centre, obtaining the two intervals: 


(a, 22) ana E í o). 


If the theorem were true for each of these sub-intervals, it would clearly 
be true for the whole interval. We must therefore suppose that subvidision 
in accordance with the theorem is impossible in at least one of the sub-intervals; 
we take this sub-interval and again divide it into two halves. As before, the 
theorem is untrue for at least one of these two new halves, and we now divide 
this half into two, and so on. We thus obtain a sequence of intervals: 


(a, b), (a,, by), (Az, be), . .., (am On), «+s 


where each successive interval is half of the preceding one, so that the length 
(bn — a4), equal to (b — a)/2", tends to zero with increasing n. Further, the 
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theorem is untrue for every (an, bn), ie. it is not possible to divide any 
(a4, bn) into new intervals so that | f(z.) — f(x) | < e provided x, and z, 
are in the same new interval. We show that this is absurd. 

By the theorem of [42], a, and 6, have a common limit: 


lim a, = lim b, = a, (30) 


this limit, like all a, and bn, lying in (a, b). We first take a in the interior 
of (a, b). By hypothesis, f(x) is continuous for x = a, and hence [34], there 
is, for the e given in the theorem, an 7 such that, for all x in the interval 
(a — y, a + n) the inequality is satisfied: 


1. 
f(a) — f(@)| < Ge. (31) 
If c, and x, are any two values from the interval (a — y, a + 7), we have: 


Fx) — Hz) = fæ) — fla) + fla) — f(x), 


whence 
|f) — fe) | < | fe) — fla) | + | fle) — fle) 1, 
and by (31): 
Ha) — fel < -F > 
i.e. 


| f(@2) — fie) | < e (32) 


for any x, and x, in (a — y, a + y). But by (30), there will exist an interval 
(a;, b,) belonging to (a — y, a + n). Hence inequality (32) will certainly be 
satisfied for any x, and x, from this interval, i.e. the theorem is satisfied 
for (aj, b,) even without further subdivision. This contradicts the fact seen 
above, that the theorem is not satisfied for any (an, bn). The theorem is thus 
proved, if a is in the interior of (a, b). If, for example, a coincides with the 
left-hand end of the interval, ie. a — a, the proof is the same, except that 
the interval (a, a + 7) is taken instead of (a — y, a + 5). 

We now turn to the proof of the third property of [35]. 

THEOREM 2. If f(x) is continuous in the interval (a,b), it is uniformly 
continuous in this interval, i.e. for any given positive e there exists a positive 
n, such that | f(x’’) — f(x’) | < e for any x’ and x” of (a, b) that satisfy the 
inequality | 2" — a’ | < m. 

By Theorem 1, we can subdivide (a, 6) into a finite number of new inter- 
vals in such a way that | f(z;) — f(x,) | < e/2, provided that x, and 2, belong 
to the same new interval. Let y be the length of the shortest of the new 
intervals. We show that our theorem is in fact satisfied for this 7. In fact, 
if x’ and z'/' are two values from (a, b), satisfying | z'' — z'| < y, either 
x’ and x” belong to the same new interval or to two neighbouring intervals. 
In the first case, we have by the construction of the new intervals: | f(x”) — 
— f(z’) | < e/2, and hence certainly | f(x’’) — f(x’)| < e. In the second 
case, we denote by y the point of contact of the two adjacent intervals con- 
taining x’ and x”. We can now write: 


f(x) — fle’) = fle") — fü) A fo») — fix), 
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ie. 


| flo") — fle’) |< | fle") — FO) | -- 116) — fle’) | (33) 
But 


Ha) — ft) << and 1) — fle) « (34) 


since 2’’ and y are in the same new interval, and similarly for x’ and y. The 
inequalities (33) and (34) give us | f(x’’) — f(x’) | < e, and the theorem is 
proved. 

Theorem 1 also gives us the following corollary. 

COROLLARY. If a function is continuous in an interval (a, b), it is bounded 
above and below, i.e. simply bounded, in this interval. In other words, there 
exists an M, such that | f(x) | < M for all x in (a, b). We take a certain definite 
£y and let n, be the number of new intervals into which (a, b) must be sub- 
divided so as to satisfy Theorem 1 for every e= e,. We have | f(z,) — f(x) | < 
« e, for any two points belonging to the same new interval. Hence it immedia- 
tely follows that | f(x) — f(a) | < n,e, for any x of (a,b), ie. all f(x) are 
included between f(a) — ne, and f(a) + n,&,. 

Since the aggregate of all f(x) of (a, b) is bounded above and below, it has 
strict upper and lower bounds [42]. We denote the former by f, and the 
latter by a. We now prove the first property of [35]. 

THEOREM 3. A function continuous in an interval (a, b) attains maximum 
and minimum values in this interval. 

We have to show that there exists an x of (a, b), such that f(x) = f, and 
similarly a y, such that f(x) = a. We confine ourselves to proving the first 
assertion, and do so by a contradiction. We suppose that f(x) is not equal 
to B for any x= of (a, b) (and hence is less than £). We form a new function: 


1 
ad B— f(x) ` 


Since the denominator is never zero, the new function is also continuous 
in (a, b) [34]. On the other hand, it follows from the definition of strict upper 
bound, that for any e > 0 anda < x < b there exists an f(x) lying between 
(8 — e) and f. Now: 0 < B— f(z) < e and g(x) < lje. Since e can be taken 
arbitrarily small, we see that g(x), continuous in (a, b), is not bounded above, 
which contradicts the above corollary of theorem 1. 

We finally prove the second property of [35]. Let f(x) be continuous in 
(a, b), and k be any number, lying betwen f(a) and f(b). We take for clarity, 
f(a) <k < f(b). We form a new function: 


F(x) = fla) — k , 
continuous in (a, b). Its values at the ends of the interval are: 
F(a) = fia) —k <0 
F(b) = fb) —k> 0 


i.e. F(x) has different signs at the ends of the interval. If we can show that 
there is an x, in (a, b) such that F(x,) = 0, we shall have f(z,) — k = 0, 
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i.e. f(z,) = k, and the second property is proved. It is thus sufficient to prove 
the following theorem: 

THEOREM 4. If f(x) ie continuous in (a, b), and 4f f(a) and f(b) are of opposite 
signs, there exista at least one x, for which f(x.) = 0. 

We prove the theorem by supposing it false, as in III. Let f(x) never 
be zero in (a, b). The new function: 

$e) = (35) 

will also be continuous in a, b) [34]. We use Theorem 1, and take any e > 0. 
We find a finite set of points in (a, b), including its ends, so that the absolute 
value of the difference of f(x) for any two neighbouring points is less than 
e. Noting that f(a) and f(b) have different signs, we can find two neighbouring 
points £, and £, such that the corresponding f(x) have different signs. Thus, 
whilst f(¢,) and f(£,) have different signs, | f(é,) — f(&,) | < e. Since f(£,) 
and f(&) are real numbers, it follows that the absolute value of each is less 
than e, ie. [f(€,)| < e. But now, by (35), | p(£,) | >1/e, and since e can 
be taken arbitrarily small, it follows that g(z), continuous in (a, b), is un- 
bounded in the interval. This is absurd. Thus the theorem is proved. 


44. Continuity of elementary functions. We proved earlier the continuit y 
of polynomials and rational functions [34]. We now consider the exponential 
function: 

y =a (a> 0), (36) 


where we take a > 1 for clarity. This function is fully defined for all positive 
rational x. It is defined for negative æ by the formula: 


a* 2, (31) 


with also, a? = 1. It is thus defined for all rational x. Also, the rules of addi- 
tion and subtraction of exponents on multiplication and division are known 
from algebra. 

If x is a positive rational number p/q, 


RN 
a* =a, 


taking the numerical value of the radical. Evidently, a? — 1, and it follows 
from the definition of a root that a* > 1 for x > 0 (taking the definition 
from [41]). It follows from (37) that 0 <a* < 1 for z < 0. We now show 
that a > a if x, >x, i.e. that a” is an increasing function. In fact, 


a — q = q (gX2 — 1), 


where z, — x, > 0, so that both factors on the right are positive. We show 
further that a* > 1 if x — 0, taking rational values. We first suppose that 
€ — 0, decreasing through all rational values (on the right). Then a* decreases, 
but remains greater than unity, and thus has a limit, which we denote by 
l. Moreover, given this variation of x, 22 also tends to zero on the right, 
through every rational value. We obviously have: 


a?* == (až), 
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so that, passing to the limit: 
i=P or KL—1) —0, 


ie. 1=1 or | = 0. The second possibility is ruled out since a* > 1. Thus, 
a* — l as «> 0 on the right. It follows from (37) that we have the same 
limit for z > 0 on the left. Thus in general, a* > 1 if z — 0, taking rational 
values. Hence it immediately follows, that if x tends through rational values 
to a rational limit b, a* > a?, In fact, 


a* — a? = a? (ar? — 1). 


Hero, (x — b) — 0, and by the above, (a*—' — 1) > 0. 

We now define function (36) for irrational x. Let a be a certain irrational 
number, and let I(a) and II(a) be the first and second classes respectively of 
the section in the domain of rational numbers that defines a. We suppose 
that x — a, increasing and passing through all the rationals of I(a). Then 
a* is increasing but bounded, being in fact less than a*”, where æ” is any 
number of II(a). Thus, for the above variation of x, a* has a limit, which we 
denote for the present by L. Similarly, if > a, decreasing through the 
rationals of II(a), a* again has a limit. We show that this limit is aiso L. 
Let x’ belong to I(a), and x” to II(a). We have: 


a* — a” AR 1) < La — 1), 
1.0, 
0 <a” — a” < L(a*-* — 1). 


For x’ and x" near a, (x" — x’) is as near zero as desired, and the same 
can be said of (a*” — a*') by the above inequality; whence follows the coin- 
cidence of the limits. We take this limit L as defining a^, i.e. a" is the limit 
to which a* tends, when x -> a through rational values. Function (36) is now 
defined for all real z. It is easy to show from the above, that the function is 
increasing, i.e. a% > a^, if x, and a, are any real numbers, satisfying £, > 2. 
We have to consider the cases separately, of z, and v, both irrational, or of 
one of them rational. We still have to show that the function is continuous 
for any real z. We must first show, that a* — 1 for x > 0, all real values being 
permitted. The proof is exactly the same as above, for rational x. Also as 
&bove, using the formula: 


a* az a? ES, at (a*t oe 1), 


we can show that a* — a" for x -> a, which proves that a* is continuous for 
any real x. 

It is easy to verify that all the basic properties of exponential functions 
apply for any real exponents. For instance, let a and f be irrationals, and 
let x > a, y ^ P, x and y varying simultaneously through rational values. 
We have for rational exponents: 


a* a = a*t, 


Passing to the limit, by the continuity of exponential functions proved 
above, we obtain the same property for irrational exponents: 


a? af == att P, 
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We also obtain the multiplication rule for raising a power to a power; 
(af = a°, 


as follows: if f = n is a positive integer, the formula follows at once from 
the rule for adding exponents on multiplication. If f = p/q is a positive 
rational, 





e [AN 
(aP! | (a2)? = Va? = ah, 


The rule follows at once for negative rationals from (37). We now take 
B irrational, and let rational r tend to B. By the above: 


(a?) = ar. 


Passing to the limit, and using the continuity of an exponential function, 
with a^ taken as base on the left, we obtain (a°) = aé, 

Before turning to logarithmic functions, we make a few remarks about 
inverse functions, which have already been briefly mentioned in the introduc- 
tion [20]. If y = f(z) is increasing and continuous in (a, b), with f(a) = A 
and f(b) = B, by the second property of continuous functions, when x 
increases from a to b through all real values, f(x) increases from A to B, 
passing through all intermediate values. Thus, for every y in (A, B) there 
is a corresponding definite x in (a,b), and the inverse function x = g(y) 
will be single-valued and increasing. If x = x, is inside (a, b), and x runs 
through a small interval (x, — e, x + e), y will run through a certain inter- 
val (Yo — N Yo + 72), where y, = f(z,). Denoting the least of the two 
positive 7, and 7, by ô, we can say that, if y belongs to (y, — ô, Ya + ô), 
consisting of only part of (y, — 7,, Yo + 72), + will belong all the more to 
the above (z, — ô, x, + ô), i.e. | p(y) — p(Yo) | < e, provided | y — yo] < ô. 
Since e is arbitrary, this gives us the continuity of function x = g(y) at 
y = Yo. If x, coincides with the end a, for example, we have to take the 
interval (£o 9, + €) in place of (x, — €, 2, + €) in the above discussion. 
The working is analogous for f(x) continuous and decreasing. 

We return to function (36). Since a > 1, a = 1 + b, where b> 0, and 
Newton’s binomial formula gives for a positive integer n > l: 


at = (1 +b)" <14nb, 


whence it is clear that a* increases indefinitely on indefinite increase of 
x. Further, it follows from (37) that a* — oo for x -> oo. On noting the above 
remarks about inverse functions, we can say that the function 


x = loga y , (38) 


the inverse of (36), is single-valued, increasing, and continuous for y > 0. 
The same results are obtained for 0 <a < 1, except that functions (36) 
and (38) are decreasing. 

We now introduce a new concept, that of a function of a function. Let 
y = f(x) be continuous for a < x < b, with its values lying in (c, d). Further, 
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let z = F(y) be continuous in the interval c < y < d. On taking the above 
function of z as y, we obtain a function of a function of x: 


z = Fly) = F(f(). 


We say that this function depends on x through the medium of y. It is 
defined for a < x < b. It is easily shown to be continuous in this interval. 
In fact, an infinitesimal increment of x gives a corresponding infinitesi- 
mal increment of y, since f(x) is continuous, and the infinitesimal incre- 
ment of y gives a corresponding infinitesimal increment of z, since F(y) 
is continuous. 

We now consider the power function 


2=ab, (39) 


with any real exponent b, and taking positive values of x. It immediately 
follows from the discussion of exponential functions that the value of (39) 
is defined for every «> 0. Using the definition of logarithm and taking 
natural logarithms, for example, we can write instead of (39): 


2 == eblog x, 


Putting y = blog x, and z = e”, we can consider the above as a function 
of a function of x, and the continuity of (39) for every x > 0 follows from 
the continuity of exponential and logarithmic functions. 

The continuity of the trigonometric functions is easily shown by using 
formulae of elementary trigonometry. From the formula: 


sin (w+ h) — sin z = 2 sin Ż cos (e+) 


we have: 


[sin (x -+ h) — sing |< 2 





si ES 
n 3 , 
since | cos (x + h/2)| < 1. But for any angle a: | sina| < |a], so that 
| sin (x + A) ~ sinz|« | ^| 


and thus the left-hand side tends to zero as h > 0, giving us the continuity 
of sin x for all x. The continuity of cos x for all x is similarly proved. The 
continuity of tan x and cot x for all z, except those for which the denomina- 
tors in the formulae below become zero, follows at once [34] from: 


sina cos c 
tan x= , cote = ——— 
cos zx 





The function y = sinw is continuous and increasing in the interval 
(— 2/2, 1/2). Using the above remarks concerning inverse functions, we 
can assert that the principal value of the function x = arc sin y [24] is a 
continuous, increasing function in the interval —1 < y < 1. The proofs 
are similar for the continuity of the remaining inverse circular functions. 
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EXERCISES ON CHAPTER I 
1. Prove that, if a and b are real numbers: 
[ja[—15||<fa—b|< [a] + ibl. 
2. Prove the following equalities: 


(a) =a — 0) Dm 





b 
(c) |a P=; (d) ya? — |a]. 
3. Determine the range of values of x for which: 
a) |z—1|«3; (b) |2z - 1| « 1; 
lat 11>2; (d) | zx - 1| «[x-r 1]. 


4. Find f(—1) 7 PAD, ), /(2, f(3) A) if f(x) = 3 — 62? + lla — 6. 
5. Find f(0), f(— 3), f(—2), far Nus )I if fz) = Vr 
6. Let f(x) = in bos Pn Yid fà (5) FA), f(10). 
7. The function f(x) is linear. Find the function if f(—1) — 2 and 
fQ) = — 
8. Find the formula for the quadratic function f(x) for which 
f(0) = 1, f(1) = 0 and f(3) = 5. 
9.It is given that f(4) = —2, f(5) = 6. Find the approximate 
value of f(4.3) by assuming that the function f(x) can be 
represented by a linear function for 4 < x < 5. inim inter- 
polation formula). 
10. Find a single formula by means of which to express the function: 


0ifx<0 
Xx) == 
he) E m 


Define the domains in e the following functions exist: 


11. (a) y = Yx +1; ( 5) y= Vet. 
12. y = 1/(4 — x”). 

13. (a) y = a? — 2; (b) y= x Vx? — 2. 
14. y = /2 + x — rê. 

15. y =Y—<+1//2+3x. 

16. y = V(x — a?). 

17. y = logy ((2 + 0)/(2 — 2)]. 

18. y = logio [(3? — 3x + 2)/(x + 1)]. 
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19. y = are cos [2z/(x + 1)]. 

20. y = arc sin log, p(x/10). 

21. Define the domain in which the function y = |/(sin 2x) exists. 

22. f(a) = 201 — 3a? — 5a? + 6x — 10. Find (x) =} [ f(x) + f(—x)] 
and y(a) — 1Lf(z) —f(—2)]. 

23. Find 9 [y(z)] and y [O(z)], if (x) = 2? and y(x) = 2. 

24. Find f (f (f(2)]) if fa) = 1/1 — 2). 

25. Find f(z + 1) if f(x — 1) = z. 

26. Let f(n) be the sum of n terms of an arithmetical progression. 
Show that f(n + 3) — 3f(n + 2) + 3f(n + 1) — fin) = 0. 

27. Prove that if f(x) is an exponential function, i.e. f(x) = a*, 
x 2 0, and if z,,2,, x, form an arithmetical progression, then 
f(x), fe), f(x.) form a geometrical progression. 

28. Let f(x) = log [(1 + z)/(1 — z)]. Show that 


fa) + fu) =I). 


29. Let g(x) =} (a* + a7*) and y(x) =3(a* —a”*). Show that 
plz + y) = plz) ply) + viz) ply) and ylz + y) = plz) v(y) + 


+ gly) y(x). 
30. Find f(—1), f(0), f(1) if 





are sin x for— co«z«0 


f(x) = , 
arc tan z for 0 « z « + œ 


31. Find the zeros and the domains in which the following functions 
are respectively positive and negative: 


(a) y=1+x; b) y=24+%-2% (©) y= 1l—a#4+ 2; 
(d) y= 2? — 3x; (e) y = log, .[2x/(1 + 2)]. 


32. Find the inverses of the following functions: 
3 
(a) y= 2x 4-3; (b) y= —Y (0) y= y1— xz; 
(d) y = log, (3%); (e) y = arc tan 3x. 
33. Find the inverse of the function defined by: 


"T xifx<o0 
aifx>o 
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34. Write the following functional relationships in the form of a 
chain of equations the elements of which are simple functions 


(eg. y= (1+ iy could be written as y = u", u=1+0, 


1 
v= zN 
(a) y = (2x — 530; (b) y — 27**; (e) y — log, tan xx; 
(d) y = are sin (3 *). 
35. From each of the following chains of equations form a com- 
pound function written in the form of a single equation: 
(a) y= w, u= sing; (b) y = arctan u, u= Vo, v = log,yt; 
(c) y = 2u if u < 0, y = 0 if u > 0, u = 22 — 1. 
36. Write down in explicit form the functions y satisfying the 
following equations. 
(a) a3? — are cos y = m; (b) 10% + 10" = 10; (c) x + ly| = 2. 
Find the domain of definition of each of the functions. 
Construct the graphs of the bilinear functions (hyperbolas): 
37. y = ljx. 
38. y = 1/(1 — 2). 
39. y — (x — 2)/(x + 2). 
40. y = yo + m/(x—%) if 2, — 1, yy = —1, m = 6. 
41. y = (2x — 3)/(3x + 2). 
Construct the graphs of the rational functions: 
42. y = x + Ife. 43. y = (a + 1). 44. y = 1/22. 45. y = 1/27. 
46. y = 10/(z? + 1) (Witch of Agnesi). 47. y = 2x/(2? + 1). 
48. y = (x + 1)/x?. 49. y = (2? + 1)/x. 
Construct the graphs of the trigonometric functions: 
50. y = sin z. 51. y = cos x. 52. y = tana. 53. y = cot x. 
54. y = sec x. 55. y = cosec x. 56. y = A sin x with A = 1, 10, 1/2,— 2. 
57. y = sin nz with n= 1l, 2, 3, 5 98. y = sin (x — p), with 
p= 0, im, ix, a,— im. 99. y=asinx+bcosx, with a= 6, 
b= —8. 60. y = sin x + cos x. 61. y = cos? x. 62. y = x + sin a. 
63. y = x sin x. 
Construct the graphs of the exponential and logarithmic functions: 
64. y = a^ with a = 2, 3, e (= 2.718.. )). 
65. y = log¿x, with a = 10, 2, 3, e. 
66. y = sinh z =3(e* — e 7). 67. y = cosh x = 1(e* +e). 
68. y = tanh x = sinh z/cosh x. 69. y = 10", 
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70. y = e^* (error curve). 71. y = 27! *, 72, y = loggt. 
73. y = (log;ox)?. 74. y = log,y(log,p2). 75. y = Iflog, x. 
76. y = log, (I/x). 77. y = log,.(—a). 78. y = log,(1 + 2). 
79. y = log,,(cos x). 80. y = 2 * sin x. 

81. Prove that as n-—> oo the limit of the sequence 


is equal to zero. For what values of n will we have the inequality 
n? < e where e is an arbitrary positive number. 

Give an estimate of the lowest such n for (a) e = 0.1; (b) e = 0.01; 
(c) e = 0.001. 

82. Prove that the limit of the sequence 2, = n/(n + 1), (n= 
=1,2,...) as n — co is equal to 1. If e is an arbitrary positive 
number find N such that |z, — 1| « e for n>N. Find N 
if (a) e= 0.1; (b) e = 0.01; (c) e = 0.001. 

83. Prove that 


lim z? = 4. 
x2 


Given a positive number e find a positive number ô such that 
|a? — 4| < e whenever |x — 2 | < ô. Calculate ô if (a) e = 0.1; 
(b) e — 0.01; (c) e = 0.001. 
84. Write out precisely what is meant by the following statements: 
(a) lim logiaz = —œ; (b) lim 2* = 4-00; (c) lim f(x) = co. 
x>+0 


x> x= o 


85. Find the limits of the sequences: 


1 1 1 (— 11 . 

DR cab n MISI C EDT 
2 4 6 2n i 
(b) i , 3 , TUB mni jeres; 


() V3 V272 V222... 


(d) 0.2, 0.23, 0.233, 0.2333, ... 





86. lim (Exe wo J. 


n? 


87. lim — (n + 1) (n + 2) (n + 3). 





lim f 143 +5+7+...+ (2n—1) n+l 
ag, lim | c QE IE 
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89. 


90. 


91 


92. 


93. 


94. 


96 


98 


100. 


102. 


104. 


106. 


108. 


110. 


112. 


114. 
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n 
lim eae EN 
noo 8 (=I) 
. gni. gota 
lu reum 
A 1 1 1 1 
lim (+ ht + e): 
A 1 1 1 (— 1): 
lim [1 r H + Spe] 


n= œ 


lim PEZH e 




















nao ns 

. lim ^"sina! 
1m (/n +1 — Yn). 9 nce uU 
lim (zd 1} im Oe 
peta io 97. um x2 — 1 
at be + 1 in AS 
ETE A E 
lim p 101. lim /—1 
x eVez M Mae 

— >. 

lim Den 103. lim dum ; 

x64 lz—4 xl jz—1 
3 3_ a 

lim (2/2 +1 105. lim 2—/2—3 
x1 (@— I ee ee 

—8 zx 

lim -2 107. lim 2 —1 
^ jz— 2 2d Vs —1 

lim 32/92 109. lim Eure e 
x>4 1— y5 — t x0 E 

1 ee L 

lim Nah h— Vo 11. lim 2 +» —Ve 
h=0 noo h 


lim (fafa — yz) 113. lim [Vz (s Fa) — 2] 


lim [Va — 5z + 6—a] 115 lim « [Yat 1 — ] 
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116. lim [+ 1—35] 117. (a) lim 222 ; (b) lim 22 
xo x~2 Xo 
n sin 3x . sin 5x 
118. ium 119. lim A 
120. lim m 121. lim n sin — 
x1 NE ne n 
122, lim -1— 595? 123. lim 522 — sina 
x0 T xa za 
— s l— siniz 
124. lim 2952 — 2089 125. Z 
sd z—a RL 
126. lim 1 — 2cos v 127. lim! I+sing —/1— sinz 
1 n —- 3x x0 x 
x 37 
3 1 P 2 y 
8. ES a 2 
12 lim (1 =) 129 lim (1 +=} 
z c x > 2— 1 y 
130. lim LT) 131. lim ( LIS ) 
n 
132. lim (1 En =| 133. lim (1 + sin x) Y* 
n-»o X>0o 


134. Show that the function y = a? is continuous for any value of 
the argument z. 

135. Prove that the polynomial P(x) = az" + a 2"! +... + an is 
continuous for all values of z. 

136. Prove that the rational function R(x) = (as a^ + a 2" ^1 + 
4 ...-F an) (bo x" +b,2 +... + bm) is continuous for all 
values of z except those for which the denominator vanishes. 

137. Prove that the function y = |/z is continuous for x > 0. 

138. Prove that if the function f(x) is continuous and non-negative 
in the interval (a,b) the function F(x) = /f(x) is continuous 
in that interval. 

139. Prove that the function y = cos xé is continuous for all values 


of x. 

140. For what values of x are the functions (a) tan x, (b) cot x con- 
tinuous? 

141. Prove that the function y = | x | is continuous for all x. Con- 


struct the graph of this function. 
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142. Prove that the absolute value of a continuous function is a 
continuous function. 

143. A function is given by the formula: 

(a? — 4)(x —2) if #2 

A ifz-2 

How should we choose the value of A — f(2) in order that the 
function f(x) wil be continuous at x = 2? Sketch the graph 
of the function y = f(x). 

144. The function f(x) is not defined for x = 0. Define f(0) in order 
that f(x) will be continuous at x = 0 if: 
(a) fle) = EI + 2)" — 1]; (b) fle) = (1 — cos 2)/a2; 
(c) fle) = log (1 +z) — log (1 —2)]; (d) fle) = He — e73); 
(e) f(x) = a? sin (1/x); (f) fa) = x cot x. 


K(x) = 


CHAPTER II 


DIFFERENTIATION: THEORY AND APPLICATIONS 


§ 3. Derivatives and differentials 
of the first order 


45. The concept of derivative. We consider a point moving in a 
straight line. The path s traversed by the point, measured from some 
definite point of the line, is evidently a function of time t: 


s = fit). 

A corresponding value of s is defined for every definite value of t. 
If ¢ receives an increment At, the path s + As will then correspond 
to the new instant t+ At, where As is the path traversed in the 
interval At. In the case of uniform motion, the increment of path 
is proportional to the increment of time, and the ratio As/At repre- 
sents the constant velocity of the motion. This ratio is in general 
dependent both on the choice of the instant ¢ and on the increment 
At, and represents the average velocity of the motion during the 
interval from ¢ to t + At. This average velocity is the velocity of 
an imaginary point which moves uniformly and traverses path As 
in time At. For example, we have in the case of uniformly accelerated 
motion: 


s=- gt + vot 
and 


1 1 
As -g 9 (t+ Ab)? + 0, (t+ At) — -y gf? — 9st 1 
Ar - E = gt + 0 + -y 94t. 





The smaller the interval of time f, the more we are justified in 
taking the motion of the point in question as uniform in this interval, 
and the limit of the ratio 4s/4t, with At tending to zero, defines 
the velocity v at the given instant t: 


As 





v — lim 
ato At 
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Thus, in the case of uniformly accelerated motion: 


v — lim = lim (gt + v, + 5-9 4t) = gt + Up. 
450 At>0 

The velocity v, like the path s, is a function of t; this function 
is called the derivative of function f(t) with respect to t; thus, the 
velocity is the derivative of the path with respect to time. 

Suppose that a substance takes part in a chemical reaction. The 
quantity x of this substance, taking part in the reaction at the 
instant f, is a function of t. There is a corresponding increment Ax 
of magnitude x for an increment of time At, and the ratio 4x/4t 
gives the average speed of the reaction in the interval At, whilst the 
limit of this ratio as At tends to zero gives the speed of the chemical 
reaction at the given instant t. 

We considered above [32] the quantity of heat Q, absorbed by a 
body, as a function of its temperature t°. Let A? and AQ be the 
corresponding increments of temperature and quantity of heat. 
Accurate measurements indicate that JQ is not proportional to 4t*, 
and the ratio 4Q/4t gives the so-called average specific heat of the 
body in the temperature interval from 1? to t° + 4€%, whilst the 
limit of this ratio as 4t” tends to zero gives the specific heat of 
the body at t°, this being the derivative of the quantity of heat 
with respect to temperature. 

The above examples lead us to the following concept of the derivative 
of a function: 

The derivative of a given function y = f(x) is defined as the limit 
of the ratio of the increment Ay of the function to the corresponding 
increment Ax of the independent variable, when the latter tends to zero. 

The symbols y” or f’(x) are used to denote the derivative: 


Y — lim So ‘ 
v 4x0 v 





A 
» m z m li 
y’ = F(x) A 


The operation of finding the derivative is called differentiation. 

It is possible for the above limit not to exist, in which case the 
derivative does not exist. Assuming that the derivative exists, we 
can write: 


Ax) — r : 
f(x 4- 2 f(x) = f (x) +a, 





where a — 0, as 4x — 0 [27]. 
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We now have: 
f(x + Ax) — f(x) = [f (2) + a), 


whence it is immediately clear that [f(x + 4x) — f(z)] O0 if 
Ax—> 0, ie. if the derivative exists for some value of x, the function 
is continuous for this value of x. The converse statement does not 
hold, ie. nothing can be said about the existence of a derivative 
from the continuity of a function. We note that in finding the derivative 
we take the fraction 4Ay/4x, with the numerator and denominator 
both tending to zero; but we suppose 
that 4x never in fact becomes zero. 


46. Geometrical significance of the 
derivative. We turn to the graph of 
the function y = f(x) to see the 
geometrical meaning of the deriva- 
tive. We take a point M of the 
graph with coordinates (x, y), and 
an adjacent point N ofthe graph with 
coordinates (x + Ax, y + Ay). We 
draw the ordinates M, M and NN of these points, and take a 
line through .M parallel to axis OX. We evidently have (Fig. 50): 








MP—HM,N,—4x, M,M=y, N,N=y+4y, PN= 4y. (1) 


The ratio 4y/4x is clearly equal to the tangent of the angle a, 
that MN forms with the positive direction of OX. As Ax tends to 
zero, point N will tend to point M, whilst remaining on the curve; 
MN becomes, in the limiting position, the tangent MT to the curve 
at the point M; hence, the derivative f’(x) is equal to the tangent of 
the angle a formed by the tangent to the curve at the point M(x, y) with 
ihe positive direction of axis OX, ie. is equal to the slope of this 
langent. 

Attention must be paid to the rule of signs when working out the 
segments in accordance with formula (1), remembering that incre- 
ments Ax and Ay can be either negative or positive. 

We see that the existence of a derivative f'(r) is bound up with 
the existence of a tangent to the curve corresponding to y — f(x). 
A continuous curve may have no tangent at all at certain points, 
or it may have a tangent parallel to axis OY, with infinitely large 
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slope (Fig. 51); the function f(x) then has no derivative for the cor- 
responding value of z. 

A curve can have any number of such singular points, and it is 
even possible to construct a continuous function, as may be shown, 
such that it has no derivative for any value of x. The curve correspond- 
ing to this function cannot be represented geometrically. 

Denoting for simplicity the increment 
of the independent variable by h, we 
have the ratio: 

Lo) f (2) 


If the number x is fixed in the interval 
in which f(x) is defined, the ratio (2) is a 
function of h, defined for all h sufficiently 
close to zero, except A = 0. The limit of 
this ratio as 4 — 0 has to be determined 
in accordance with what was said in [32]. If the limit exists, it gives 
us the derivative f(x). The existence of the limit is equivalent to the 
following [32]. For any given positive e there exists a positive y such that 





Fic. 51 


<e for |A| <n and h#0. 





It can happen that ratio (2) has a limit for h tending to zero on the 
side of positive values (on the right), and on the side of negative 
values (on the left). These limits are usually denoted by f'(x + 0) 
and f'(x— 0), being called respec- 
tively the derivative on the right and 
the derivative on the left. If these 
limits differ, they give the slopes 
of the tangents to the curve at its 
bend point (if the tangents exist). 
Figure 51 shows these tangents at 
the point M. 

The existence of the derivative is 
equivalent to the existence of deriva- 
tives f(x + 0) and f(x — 0) and to 
the fact of these being equal, so that 
we have f(x) =f'(2+ 0) = f(s — 0). 
It is possible for a continuous function to have points for which 
there is neither derivative f'(x + 0) nor f'(x — 0). Such a curve is 
shown in Fig. 52. It has neither derivative for x = c. 





Fia. 52 
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If a continuous function is given solely in an interval (a, b), we can 
only form the derivative on the right, f'(a + 0), at x = a, and only 
the derivative on the left, f'(b — 0), at x =b. When f(x) is said 
to have a derivative f'(x) in the (closed) interval (a, b), this must be 
taken to mean the derivative in the ordinary sense for interior points 
of the interval, and in the special sense indicated at the ends of the 
interval. 

If f(x) is defined in the interval (A, B), wider than (a,b), ie. 
A « a and B >b, and has the ordinary derivative f'(x) inside (A, B) 
it will certainly have a derivative in the sense indicated over (a, b). 


41. Derivatives of some simple functions, It follows from the concept 
of derivative that, to find the derivative, the increment given to the 
function must be divided by the corresponding increment of the independent 
variable, the limit of their ratio then being found as the increment of the 
independent variable tends to zero. We use this rule for some elementary 
functions. 

I. y — b (constant) [12]. 


y lim 272. = lim o — 0, 
h90 h-0 





i.e. the derivative of a constant is zero. 
IL. y = x" (n a positive integer). 





y’ — lim Ein 
h>0 
a tn peat ty colma 
er i 
= lim nani 2o han +... A^ = ng", 


In particular, if y = x, y’ = 1. We later generalize this rule for 
differentiation of a power function for any value of the exponent n. 
TIT. y = sin x. 








1 . 1 
. . 2008 (æ + 52 ) sin» 
— . 2 2 
y' = lim 2 (x +h) — sinz — lim — 
h->0 h h—>0 h 
1 sin 7h 
= lim eos (z + 5- == = COS T, 


h=>0 —— 
yh 


4 
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duc 1 
since —,——— 1 for 3 h=>0 [33]. 
— h 
2 
IV. y = cos x. 
g 1 1 
< 2sin(s + — h Jsin 2 
y’ = lim cos (x + h) — cos x zjm > 2 = 
h=>0 h=>0 
sod 
1 sin -ył 
= — limsin (z+ -5-4 | i = — gina. 
->h 
2 
V. y = log x (x > 0). 
h 
log 4) 
y’ = lim log (e + I) — loge = lim ( i z dm 
hon h>0 
h 
E E (i) 1 
i lI E 
hoo Y LE ud 
x 


since for h-> 0, a=h/x also tends to 0, and log (1 + a)/a—>1 [38]. 

VI. y = cu(x), where c is a constant, and u(x) a function of x. 

cu(z + 2 — cu(x) Silm u(z + = u(x) 
h->0 


y’ = lim 
h—0 


= cw (x), 


i.e. the derivative of the product of a constant and a variable is equal 
to the product of the constant and the derivative of the variable, or, in 
other words, the constant can be taken outside the sign of the derivative. 
VII. y = log,z. 
As we know, log¿x = log x x 1/log a [38]. Using Rule VI, we obtain: 


1 1 
— X >a: 
c log a 


y = 


VIII. We consider the derivative of the sum of several variables; 
we confine ourselves to three terms for clarity: 


= u(x) + v(z) + w(z), 








y! = lim le +h) + oe +b) + wle + 2] — [u(e) + ote) + te] 
~ h 
h=>0 
L3: u(x + h) — u (x) v(x + h) — v(x) wz+h)—w(a) ] | 
5 lim [ h a h + h | E 


=u' (x) + v (x) + w (2), 
i.e. the derivative of the sum of any given number of functions is equal 
to the sum of the derivatives of these functions. 
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IX. We now consider the derivative of the product of two functions: 
y = u(x) x v(z), 


yf — lim “EEA X v (a + h) — uta) x oa) 
h=0 h 
Adding and subtracting u(x + h) x v(x) in the numerator, then 
re-arranging, we get: 


y’ = lim u(z + h)xv(z + h) — u(z + h)xv(x) + u(z + h)xo(2) — u) xol) — 
= ME MED MEE, E 
h>0 


= lim u(x + h) German, + lim v(x) EL = 
h>0 h-0 


= u(x) x v'(x) + v(x) x w'(z), 


ie. we have shown that for two factors, the derivative of the product 
is equal to the sum of the products of each factor with the derivative of 
the other. 

We prove the applicability of this rule to three factors by combining 
two factors in one group and using the rule for two: 

y = u(x) x v(x) x w(x), 

y” = {[ulx) x v(z)]x wo) = [u(z) x v(x) x w'(z) + 

+ w(x) x [u(z) xo(@)V = 
= u(x) x v(x) x w'(x) + u(x) x v'(z) x w(x) + w'(x) x v(x) x w(x). 

Using the well-known method of mathematical induction, the 
rule for two can easily be extended to the case of any finite number of 
factors. 

X. Now let y be a quotient: 


u(x + h) u(x) 














_ ula) im OEFA Tate) 

ya 8L, y= lim HEM, 
EET . uc h) v(x) — v(x + h) u(x) 
= zm a) vía + h) h i 


Adding and subtracting u(x)v(x) in the numerator of the second 
fraction, and taking the continuity of v(x) into account, we get: 


u(a+h) v(x) — u(x) v(x)+ u(x) v(x) — v(x--h) u(x) 
h = 





y' = lim 


CS 





. 1 u(z + h) — u(x) v(x +h) — os) ] 
p nm v(z) v(z + h) [ota h — u(z) h ] n 


u'(x) v(x) — v(x) u(z) 


[o(=)1? ; 
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ie. the derivative of a fraction (quotient) is equal to the denominator 

times the derivative of the numerator, minus the numerator times the 

derivative of the denominator, all divided by the square of the denominator. 
XI. y = tan x. 




















A sin g J (sin xy cosg — (cosa) sing  — 
y cosa) — cos? & 
cos? x + sin? x 1 
x cos? g costa * 
XII. y = cot x. 
r__ (coso y __ (eos z)' sin z — (sin x)’ cos z 

Y = sing sin? x 

.. — sin?g— cosg —— 1 
sin? x b sing ` 


In deducing Rules VI, VIII, IX and X, we assumed that the 
functions u(x), v(x), w(x) have derivatives, and proved the existence 
of a derivative of the function y. 


48. Derivatives of functions of a function, and of inverse functions. 
We recall the concept of function of a function [44]. Let y — f(x) 
be a function, continuous in some interval a < x < b, with its value 
lying in the interval c < y « d. Further, let z — F(y) be a function 
continuous in the interval c < y < d. Taking the above function 
of x as y, we obtain a function of a function (composite function) 
of z: 


z = Fly) = F(f(2)). 

This function is said to depend on æ expressed by means of y. 
It is easily seen that the function is continuous in the interval 
a « x « b. In fact, given an infinitesimal increment of z, there 
is a corresponding infinitesimal increment of y, due to the con- 
tinuity of f(z), and given the infinitesimal increment of y, there 
is a corresponding infinitesimal increment of z, due to the con- 
tinuity of F(y). 

We make one remark, before deducing the rule for differentiation 
of a function of a function. If z == F(y) has a derivative for y = Yo, 
it follows from the above [45], that we can write: 


Az = Fly, + Ay) — Fly.) = LF (yo) + al Ay , (3) 


where ais a function of Ay, defined for all positive values of Ay approach- 
ing zero, and where a—- Oas4y->0 (4y% 0). Equation (3) remains 
valid for 4y=0 with any choice of a, since for 4y=0, Az 
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also = 0. It is natural from the above to take a = 0 for dy = 0. 
Having agreed on this, we can take a— 0 in formula (3) for 4y>-0 
in any manner, even whilst taking values equal to zero. We now 
formulate the theorem about the derivative of a function of a func- 
tion. 

THEOREM. If y = f(x) has a derivative f'(x,) at x = x, and z = F(y) 
has a derivative F "(y y) at y, = f(x), the function of a function F(f(x)) 
has a derivative at x = x, equal to the product F(yy)f' (xp). 

Let Ax be the increment (not zero) that we give to the value x, 
of the independent variable x, and Ay = f(x, + 4x) — f(x) be the 
corresponding increment of variable y (its value can be zero). Further, 
let Az = F(y, + Ay) — Fly,). The derivative of the function of a 
function z = F(f(x)) with respect to x, at x = x, is evidently equal 
to the limit of the ratio 4z/4x as dx—0, if this limit exists. 
We divide both sides of (3) by Az: 


A , A 
TF = [E 2L. 
As Ax— 0, Ay also — 0, due to the continuity of function y = f(x) 
at the point x = x,; and hence, as we have shown above, a > 0. 


The ratio 4y/4x now tends to the derivative f’(x,), and on passing 
to the limit in the equation above, we obtain: 


_ A , , 
lim -Fr =P (yo) f (2), 


which proves the theorem. We remark that the continuity of f(x) at 
x = x, follows from the assumption of the existence of a derivative 
f'&) [45]. 

This theorem can be put in the following form, as a rule for dif- 
ferentiation of functions of a function: the derivative of a function 
of a function is equal to the product of the derivative with respect to the 
inlermediate variable and the derivative of the intermediate variable 
with respect to the independent variable : 


a = F'(yf'(x). 


We pass to the rule for differentiation of inverse functions. If 
y = f(x) is continuous and increasing in the interval (a, b) (ie. the 
greatest value of y corresponds to the greatest value of x), with 
A = f(a) and B = f(b), we know, [21] and [44], that a single-valued, 
continuous, and likewise increasing, inverse function x = g(y) exists 
in the interval (A, B). Since it is increasing, if Ax = 0, dy = 0, 
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and conversely; and due to continuity, Ax— 0 implies Ay — 0, 
and conversely. (The case of a decreasing function is exactly similar.) 

THEOREM. If f(x) has a non-zero derivative f'(x,) at the point xp, 
the inverse function ply) has a derivative at the point y, = f(%p): 


5 Lm 1 
p (Yo) A Flza) x (4) 


Denoting corresponding increments of x and y by Ax and dy, i.e. 





Ax = e(yo + Ay) — Po); 
Ay = f(x, + Ax) — flay), 


and noting that both these differ from zero, we can write: 





da — 1 
Ay ~ Ay 
Az 





As we have seen above, Ax and Ay tend simultaneously to zero, 
and the last equation leads to (4) in the limit. The present theorem 
can be put in the form of the following rule for differentiation of 
inverse functions: the derivative of an inverse 
function is equal to unity divided by the deriva- 
tive of the direct function at the corresponding 
point. 

The rule for differentiation of inverse func- 
tions has a simple geometrical interpretation 
[21]. The functions x = p(y) and y = f(x) 
have the same graph in the XOY plane, 

Fie. 53 the only difference being that the axis of 
the independent variableis OY, and not OX, 
for the function x = p(y). On drawing the 

tangent MT, and recalling the geometrical significance of the deri- 
vative, we get: 





f'(x) = tan (OX, MT) = tan a, 
p (y) = tan (OY, MT) = tan B 3 


angle f as well as a being reckoned positive, as in Fig. 53. 


But evidently, B = + x — a, and hence: 


1 ; à 1 
tan f == 57 + l.e. PAY) = -F 


48] DERIVATIVES OF FUNCTIONS OF A FUNCTION, AND OF INVERSE FUNCTIONS 111 


If x = p(y) is the inverse of y = f(x), the converse is evidently 
true, ie. y = f(x) can be considered the inverse of x = g(y). 

We use the rule for differentiation of inverse functions for the 
exponential function. 

XIII. y = a” (a > 0). 

The inverse function in this case will be: 


t= ply) = log, y , 
and by VII: 


1 1 


qiu) eem d log a 


whence by the rule for differentiation of inverse functions: 


AE SAT XY gx 
y LU y (y) = Y log a, or (ax) Q loga. 


In the particular case of a = e, we have: 
(e*)’ = e. 


The formula obtained, together with the rule for differentiation 
of a function of a function, enables us to calculate the derivative of 
a power function. 

XIV. y = 2” (x > 0; n is any real number). 

This function is defined, and is positive, for all x > 0 [19]. 

Using the definition of logarithm, we can express our function as 
a function of a function: 

y = g =e nlogx 

Using the rule for differentiation of a function of a function, 

we get: 


n 
= gt — = ng}, 


7 — pnlogx 
=e . 
y x 





n 
c 


This result can be easily generalized for the case of negative z, 
provided , the function exists for such values, as for instance y — 
= g yx. 

We use the rule for differentiation of inverse functions for obtain- 
ing the derivatives of the inverse circular functions. 

XV. y = arc sin x. 

We consider the principal value of this function [24], i.e. the arc 
lying in the interval (— 1/2, + 2/2). We can consider this as the 
inverse of function x = sin y, and in accordance with the rule for 
differentiation of inverse functions, we have: 

1 1 1 1 


Y= = m E 
x xy cos y yl — sin? y yI z 








> 
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where the plus sign must be taken with the radical, since cos y has a 
positive signin the interval (— 7/2, + 2/2). We can similarly obtain: 
1 
Yi— wz ` 
the principal value of are cos x being taken, ie. the arc contained 
in the interval (0, z). 
XVI. y = arc tan x. 
The principal value of are tan x lies in the interval (7/2, 1/2), 
and we can consider this function as the inverse of x = tan y; hence: 


(arc cos x)’ = — 


RN 1 1 "EN 


Se est eua — 244 — = 4 
Yx = gy i Cos" J = Ty taniy T+ a 











cos? y 


We obtain similarly: 

1 
dra 

XVII. We also consider the differentiation of a function of the 
form: 


(are cot x)’ = — 


v 
y =u, 


where u and v are functions of x (exponential function). 


We can write: 
y = e log t 


and on using the rule for differentiation of a function of a function, 


we obtain: 
y! = "leu (v log u)'. 


Using the rule for differentiation of a product, and differentiating 
log u as a function of a function of x, we finally have: 

y'= e*t" (v log u + a) 
or 

y’ =u (v log u + zu) ; 


49. Table of derivatives, and examples. A list follows of all the 
rules that we have deduced for differentiation. 
1. (c)’ = 0. 
2. (cu)” = cu”. 


3. (Uy + Uy +... + Un)’ = uf will tap. 
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4. (Uy Uz... Un)" = UY U¿Uz Un F Uy Ug tg... Un FH... F 
+ Uy Uy Uz... Un» 

uy _ uo—vu 

o) y? 

(x")’ = na"! and (z)’ = 1. 


, 


(log; zy = = and (log 2)’ = Ifa. 


1 

log a 
(e*)’ =e" and (a*)’ = a* log a. 
. (sin x)’ = cos x. 


o» s 9 g 


10. (cos z)' = —sin x. 


A 1 
11. (tan zy = ae 


1 
12. (cot r= — nir 
-— 
Jfi—a' 
1 


fia 





13. (arc sin x)’ = 





14. (are cos x)’ = 
ne 

15. (are tan x)’ = tom 

"E 1 

16. (are cotxy = bcr 

17. (u^)! = vu? y + v" log uv”. 


18. yx = yn * "X (y depends on x through the medium of u). 
1 


19. a’, = ——. 
Yo Yy 

We use the above rules to solve a few examples. 

l. y = xz? — 322 + x — 10. 

Using Rules 3, 6, and 2, we obtain y’ = 3x2? — 6x + 7. 








2. y= E = gu, 
Vz: 
: : , 2 5/3 2 
Using Rule 6, we obtain: y” = — 34€ == 3 
3x la? 


3. y = sin? zx. 
We put u = sin z and use Rules 18, 6, and 9: 


y! = 2u - u’ = 2 sin g cos x = sin 2x. 


4. y = sin (a?). 
We put u = x? and use the same rules: 


y’ = cos u * wu’ = 2g cos (z?). 
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5. y = log (x + V1 + æ), 
We first put u = x +x + 1, then v = a? + 1, and use Rules 18 (twice), 
7, 3, and 6: 


, 4y + 
y === (1 + priv’) = EL — 














u u 
1 : a+lapi o 1 
w+ lat+ 1 Vaz +1 fz2 +1 
x n 
6. y — (55) 
We put u = zT and use Rules 18, 6 and 5: 
; = _, 22 + 1 — Ww na} 
== n—i;,,; — n—i <= 
=nu ETE E 


7. y = x*. 
Using Rule 17, we obtain: 


y’ = alee + z*logz = a*(1-- log x). 


8. The function y is given as an implicit function of v by the equation: 
g? y? 
w tlO (5) 


The problem is to find the derivative of y. 

If we solved the given equation for y, obtaining y = f(x), the left-hand 
side of the equation would evidently be identical with zero after substituting 
y = f(x). But the derivative of zero is the same as the derivative of a constant 
equal to zero, and hence we must obtain zero on differentiating the left- 
hand side of the given equation with respect to x, y being taken as the func- 
tion of x given by this equation: 

2. 
= y =0, whence y’ = a 

We see that in this case y” is expressed in terms of y as well as x; but we 
did not need to solve equation (5) for y, i.e. obtain an explicit expression for 
the function, in order to obtain the derivative. 

Equation (5) represents an ellipse, as is known from analytic geometry, 
and the expression obtained for y' gives the slope of the tangent to the 
ellipse at the point with coordinates (x, y). 


50. The concept of differential. Let 4x be the arbitrary increment 
of the independent variable, which we still take as not depending 
on z. We call it the differential of the independent variable, and denote 
it by the symbol Az or dz. The latter symbol in no circumstances stands 
for the product of d and x, being used only as a symbol for denoting 
an arbitrary quantity, independent of x, which we take as the incre- 
ment of the independent variable. 
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The product of the derivative of a function and the differential of the 

independent variable is called the differential of the function. 
The differential of a function is denoted by the symbol dy or df(z): 
dy or df(z) — f'(z)dz. (6) 


This formula gives us an expression for the derivative in the form 
of the quotient of the differentials: 





(Ong ae 

The differential of a function does not coincide with its increment. 

To explain the difference between these concepts, we turn to the 

graph of a function. We take a certain point M(x, y) on the graph, 
with a second point N. We draw the 
tangent MT, the ordinates corresponding 

to M and N, and the line MP parallel y 

to axis OX (Fig. 54). We have: 


MP = M,N, = Asx (or dz), 
PN = Ay (increment of y), 
tanZ PMQ = f'(x), 





whence 
dy = f'(z)dz = MP tan/ PMQ = PQ. 


The differential of the function, consist- 
ing of PQ, does not coincide with PN, o 
which gives the increment of the function. The segment PQ gives 
the increment that would be obtained if we replaced segment 
MN of the curve in the interval (x, z + dz) by the segment MQ of 
the tangent, i.e.if we took the increment of the function as proportion- 
al to the increment of the independent variable in this interval, 
the coefficient of proportionality being taken as equal to the slope 
of the tangent MT, or, what amounts to the same thing, to the 
derivative f’(zx). 

The segment NQ gives the difference between the differentialand the 
increment. We show that, if A x tends to zero, this difference is an 
infinitesimal of higher order with respect to 4% [36]. 

The ratio 4Ay/4x gives the derivative in the limit, hence [27]: 


A 
A =H (a) +e, 
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where e is an infinitesimal along with Az. We get from this equation: 


Ay — f'(x)Ax + ex 
or 
Ay=dy+ edz, 


whence it is clear that the difference between dy and Ay is (—ed4a). 
But the ratio of (—e4x) to Az, equal to (—e), tends to zero along 
with Az, i.e. the difference between dy and Ay is an infinitesimal 
of higher order with respect to Az. We note that this difference can 
take either sign. Both 4x and the difference take the positive sign 
in our figure. 

Formula (6) gives the rule for finding the differential of a function. 
We use it in some particular cases. 

I. If c is a constant, 


de = (c)'dr = 0-dz=0, 
Le. the differential of a constant is zero. 
II. d [cu(x)] = [cu(z) dx = cw'(x)dx = c du(x), 
ie. a constant factor can be taken outside the differentiation sign. 
TIL. d [u(z) + ofa) + w(z)] = [u(z) + v(x) + w(2)]de = 
= [u (a) + v (2) + w(x) dz = w'(x)dx + v'(z)dz + w'(x)dx = 
= du(z) + de(z) + dw(2), 
ie. the differential of a sum is equal to the sum of the differentials of 
the terms. 
IV. d[u(aju(ajwl(a)] = [u(x)v(x) w(x) de = 
= v(z)w(z)u'(x)dz + u(z)w(z)v'(a)dx + w(x)v(z)w'(z)dz = 
= e(x)w(x)du(x) + u(x)w(x)do(x) + u(x)v(z)dw(z), 
Le. the differential of a product is equal to the sum of the products of 
the differential of each with the remaining factors. 


We confine ourselves to the case of three factors. The same result 
is obtained for any finite number of factors. 











«wa _ fua ya v (x) w (x) de — u (e) v (x) dz — 
V. diern leer]? [o œj] 5 
v (x) du (x) — u (x) dv (x) 

E [v (z)]? : 
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ie. the differential of a quotient (fraction) is equal to the denominator 
times the differential of the numerator, minus the numerator times the 
differential of the denominator, all divided by the square of the deno- 
minator. 

VI. We consider a function of a function, y == f(u), where u is a 
function of x. We find dy, assuming y dependent on x: 


dy = yx dz = f'(u) - us de = f'(u)du , 


ie. the differential of a function of a function has the same form as 
would be obtained by treating the auxiliary function as independent 
variable. 

We consider a numerical example, so as to compare the mag- 
nitudes of the increment of a function and its differential. 
We take the function: 


y = f (£) = 2 + 22? + 4x + 10 
and consider its increment: 
f (2.01) — f (2) = 2.013 + 2x 2.01? + 4x 2.01 +10 — 
— (23 + 2x2? + 4x2 + 10). 


On performing all the operations, we obtain the magnitude of the 


increment: 
Ay = f (2.01) — f (2) = 0-240801. 


Calculation of the differential is much easier. Here, 
da = 2.10 — 2 = 0.01, and the differential of the function is: 


dy = 3(? + 4z + 4) de = (2x 2? + 4.2 + 4)x0.01 = 0.24. 


On comparing Ay and dy, we see that they coincide to three 
decimal places. 


51. Some differential equations. We have shown that replacing the incre- 
ment of a function by its differential in the interval (x, x + du) is equivalent 
to using the law of direct proportionality between the increment of the 
function and that of the independent variable, with the corresponding 
coefficient of proportionality. Furthermore, we know that this substitution 
involves an error that is an infinitesimal of higher order with respect to 
dx. This is the basis of an application of the analysis of infinitesimals to 
the study of natural phenomena. 

An effort may be made to split up a process into small elements when 
observing it, so that each element satisfies the law of direct proportionality 
on account of its smallness. An equation is obtained in the limit, represent- 
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ing the relationship between the independent variable, the function, and 
their differentials (or derivatives). This is called a differential equation, cor- 
responding to the observed process. The task of finding the function itself 
from the differential equation is the task of integrating the differential equation. 

When applying the analysis of infinitesimals to the study of some law 
of nature, a differential equation has to be derived for the law in question, 
then integrated. The latter task is usually far more difficult than the former, 
and more will be said about it later. We introduce the differential equations 
of a few very simple natural phenomena in the next sections. 

1. Barometric formula. The atmospheric pressure p per unit area is evi- 
dently a function of the height above the earth’s surface. We take a vertical 
cylindrical column of air of unit cross-sectional area. We consider two cross- 
sections, A and A,, at heights h and h + dh. On passing from A to A,, the 
pressure p decreases (if dh < 0) by an amount equal to the weight of air 
contained between A and A, in the cylinder. If dh is small, we can take the 
density o of the air as approximately constant in this part of the cylinder. 
Since the base-area of the small cylinder AA, is unity, and its height dh, 
its volume is dh, and the required weight is ọdh. Thus the decrease in p 
(for dh < 0) is equal to odÀ: 


dp = — odh. 
By Boyle’s law, the density e is proportional to the pressure p: 
g = cp (c is a constant), 


&nd we finally obtain the differential equation: 


dp=-—cpdh or = — cep. 


dp 
dh 

2. Chemical reaction of the first order. Let a certain substance, of mass 
a, take part in a chemical reaction. We let x denote the part of this mass 
that has already taken part in the reaction at the instant t, measured from the 
start of the reaction. Evidently, x is a function of t. It can be taken approxi- 
mately, for certain reactions, that the amount of substance dz, taking part 
in the reaction in the interval of time from t to t + dt, is proportional to 
dt for small dt, as also to the amount of substance that has stil not taken 
part in the reaction at the instant 4: 


dg = c¢ (a — x)dt or E GS es 


dt 


We transform this differential equation, taking a new function y = a — x 
instead of x, where y denotes the mass that has still not taken part in the 
reaction by the instant t. Noting that a is a constant, we have: 


t. MS 

dt ~ dt?’ 
so that the differential equation of a chemical reaction of the first order 
can be rewritten in the form: 


dy 
dt Ye 
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3. Law of cooling. We suppose that a certain body, heated to high tem- 
perature, is placed in a medium witha constant temperature of 0”. The tempera- 
ture 0 of the body on cooling will be a function of time £, which we measure 
from the instant of placing the body in the medium. The amount of heat 
dQ given up by the body in the interval dí can be taken as approximately 
proportional to the length dé of the interval and to the difference in tempera- 
ture of the body and the medium at the instant ¿ (Newton's law of cooling). 
We can then write: 


dQ = c, 0 dí (c, is a constant). 
Denoting the specific heat of the body by k, we have: 
l dQ = —kde, 


where the minus sign is taken because d@ is negative in the present case (the 
temperature falls). On comparing these two expressions for dQ, we get 
e do 


do = — o8 dr (e=) or a eS 


cis a constant, if we take the specific heat k as constant. All the differential 
equations that we have deduced above have the same form. They all express 
the proportionality of the derivative to the function itself, with a nega- 
tive coefficient (—c). 

We showed in [38] that continuous interest on a basic capital a over t 
years yields an accumulated capital of ae“t, where k is the percentage interest 
expressed as a fraction; we write: 


y = ao. (7) 
We obtain the derivative: 


y’ = akot = ky , (8) 


i.e. we obtain here the same property of proportionality of the derivative 
to the function itself, which is why this property is referred to as the 
law of compound interest. We shall show later that function (7) gives all 
the solutions of differential equation (8) with any value of the constant 
a, in place of which we shall write C. 

Thus, the solutions of our equations can be set in the form (replacing 
k by —c): 


p(h) = Ce-*^, y(t) = Cot, e(t) = CeT“ , (9) 


where C is a constant. We now indicate the physical significance of constant 
C in each of the above formulae. Putting h = 0 in the first formula, we get: 


C= p(0) = po, 


where p, is the atmospheric pressure at h = 0, i.e. at the earth’s surface. 
The second formula gives us at ¢ = 0: 


C= y(0), 
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i.e. O is the mass that has not taken part in the reaction at the initial instant, 
which was earlier denoted by a. Finally, putting t = 0 in the third of the 
formulae (9), we see similarly that C is the initial temperature 0, of the body 
at the moment of placing it in the medium. We thus have these results: 


p(h) = poe", y(t) = ae, 0(t) = 0,07%, (10) 


52. Estimation of errors. When a magnitude v is found in practice or is 
roughly calculated, an error 4x is obtained, called the absolute error of the 
observation or ealeulation. It does not characterize the accuracy of the 
observation. For instance, an error of about 1 cm in giving the length of 
a room would be permissible in practice, whereas the same error in determin- 
ing the distance between two nearby objects (say the source and screen of 
a photometer) would point to great inaccuracy in the measurement. Hence 
follows the further concept of relative error, this being equal to the absolute 
value of the ratio | Azja | of the absolute error to the measured magnitude. 

We now suppose that a certain magnitude y is defined by the equation 
y — f(x). An error 4x in defining x leads to a corresponding error 4y. For 
small values of Ax, dy can be approximately replaced by the differential 
dy, so that the relative error in defining magnitude y is given by 


zw. 
ale 





Examples. 1. The current 7 is given with a tangent galvanometer by the 
well-known formula: 
i = c tan Q. 


Let dy be the error in reading angle o: 


A c di c 
di— dose i = opo tanp 


cos? p dp, 


d? = Sin Op 
whence it is clear that the relative error | di/i| in determining ¢ will be smal- 
ler, the nearer g is to 45°. 


2. We take the product wv: 


d (uv) = vdu + udv, 





> 


d (uv) du du 
uo  u + v 
whence it follows: 
d (uv) 
uv 





< 


, 


dv 
v 








du 
aa 








i.e. the relative error of a product is not greater than the sum of the relative 
errors in the terms. 
The same rule is obtained for a quotient, since : 








u 
u vdu—udv v du dv 
d = , = 5 
v y? u u v 
v 
duv du dv 
uv u vi 
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3. We consider the formula for the area of a circle: 


dq _ 2nrdr — y Ar 


( re? r 





Q =11?, dQ = 2ar dr, 


ie. the relative error in determining the area of a circle is by the above for- 
mula equal to twice the relative error in determining the radius. 

4. We suppose that angle p is defined by the logarithms of its sine and 
tangent. We have by the rules of differentiation: 


PEE A 
log 10 x tan p x cos? y 


cos p dy 


d (log,, sin y) = fog 10% ain O 


> d (logy tan g) = 
whence 


log lOxsin g d 


mem cos 9 


(log,, sin v), dy = log 10 Xsin y cos y d (log, tan o). (11) 


We suppose that we make the same error in finding log,, sin y and 
log,, tan o (this error depends on the number of decimal places in the loga- 
rithmic tables that we use). The first of formulae (11) gives a value of dg 
of greater absolute value than that obtained with the second of formulae 
(11), since the product log 10 x sin @ is divided in the first case, and multiplied 
in the second case, by cos p, and | cos p | < 1. It is thus better to calculate 
the angles using the table for log,, tan g. 


S 4. Derivatives and differentials 
of higher orders 


53. Derivatives of higher orders. The derivative f'(x) of the function 
y — f(x) is also a function of z, as we know. On differentiating it, 
we obtain a new function, called the second derivative or the derivative 
of the second order of the original function f(x), and denoted by: 


y" or f(x). 
On differentiating the second derivative, we obtain the derivative 
of the third order, or simply, the third derivative: 


y” or f" (x) 3 
Using the operation of differentiation in this way, we obtain the 
derivative of any order n, y” or f(a). 


We consider some examples. 
1. y — e^, y = ae”, y" =ate™,...,y¥™ =a ett, 
2. y = (ax + ye y” = ak(ax + by, 

= a? k(k — 1) (ax + b) 3, ..., 


T ae (k — 2)... (k — n + 1) (az + b)". 
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3. We know that: 
. y 1 A 1 

(sin z)' — cos z = sin (2+3=) , (cos 1)” = — sin x = cos (s 7] a 


ie. differentiation of sin x= and cos x amounts to increasing the 
argument by 2/2, and hence: 


(sin 2)” = [sin (2 +37) |= sin (z 4- 2-52) , (z4- 32) = 
= sin (e + 2-71) y 


and in general: 


(sin z)? — sin (2 +n > a) and (cos 2) = cos(u +n d a) ; 


2 
, 1 " 1 
4. y = log (1 + 0,4 =7Í3> Y =— Wear 
' (n) — (— TT —n 
yep er eC EE 


5. We consider the sum of functions: 
y=utov-+w. 
Using the rule for differentiation of a sum, and assuming that the 
corresponding derivatives of functions w, v and w exist, we have: 
y —w-Fv dw, y su tot’, yo = 4,00 je y EE wf, 


i.e. the derivative of any order of a sum is equal to the sum of the deriva- 
tives of the same order. For example: 


y = a3 — 4a? + Te + 10; y’ = 3a? — 8x + 7; y" = 6x — 8; y" = 6; 


y? = 0 and generally, y = 0 for n > 3. 

It can be shown in the same way that, in general, the n-th deriva- 
tive of a polynomial of degree m is zero if n > m. 

We now consider tbe derivatives of the product of two functions 
4, v viz., y — uv. We use the rules for differentiation of a product 
and a sum: 


y! =w v J- wv', y" — wv" v 4- w' v' +u v! 4- uw" = 
= u” v + Qu’ v + uv”, y” m u” v + u” v’ + 29" v + Qu’ o" + 
Hw v" + uv" =u" y + 3u" v! + 3w' v" + uv””. 


We note the following rule: in order to obtain the n-th derivative 
of the product uv, (u + v)" should be determined by Newton's binomial 
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formula, and the exponents of powers of u and v in the result should be 
taken to indicate the orders of derivatives, the zero powers (u? = v? = 1) 
appearing in the extreme terms of the result being taken as the functions 
themselves. 

This rule is known as Leibniz’s theorem, and is written symbolically 
as: 


(n) __ (u ah ye, 


We prove this theorem by induction. We suppose that the rule is 
true for the nth derivative, i.e: 


A 





ya = (u + p)m = uo + T yl) y + ———- (12) y” +...+ 
n(n—1)...(n—k--l1 = 
E ( ) d ) (AH) yf dee uvm. (1) 
'To obtain yr), we differentiate the above sum with respect to x. 


The general term u^? v? of the sum gives, by the rule for differentia- 


tion of a product, the derivative u®~**) o + OD ED But this 
latter sum can be written symbolically as: 


wr" (u + v). 


In fact, we have regarded the orders of the derivatives as indices 
and we have factorized the expression 


y+) 400 + qi (i7 y (+1) 


Hence we see that yt? must be obtained by multiplying each 


term of the sum (1), and hence all the sum, symbolically by 
(u + v), so that: 


yD = (u + v)? x (u + v) = (u 4- vO, 


We have shown that if Leibniz's theorem is true for any given n, 
it is true for (n + 1). But we have shown directly that it is true for 
n = 1, 2, 3; and hence it is true for all n. . 

We take as an example: 

y = e” (3a? — 1) 


and find y; 


yo) — (e¥)0 (332 — 1) ae. ae (eye (32? — 1)’ + 


4 20739. (ox) 9 (ga — 1)" +... + e* (82? — 10, 
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All the derivatives of a polynomial of the second degree are zero 
as from the third order, and (e*) = e*, whence: 


y = e* (34? — 1) + 100 e*x 6x + 4950 e* x 6 = 
= e* (3a? + 600x + 29,699). 


54. Mechanical significance of the second derivative. We consider 
the motion of a point on a straight line: 


s = f(t) , 


where, as usual, r denotes time, and s is the path, measured from 
some fixed point of the line. We obtain the velocity of the motion by 
differentiating once with respect to t: 


v = f'(t). 


We obtain the second derivative as the limit of the ratio 4 v/At 
as At tends to zero. This ratio characterizes the rate of change of 
velocity in the interval 4t, and gives the average acceleration in 
this interval, whilst the limit of the ratio as At — 0 gives the accelera- 
tion w of the observed motion at time t: 


w= f"(i). 
We take f(t) as a polynomial of the second degree: 
s=aPtbt+c, v= 2at +b, w= 2a, 


i.e. the acceleration w is constant, and the coefficient a — w/2. etting 
t=0, we get b = vp, ie. the coefficient b is equal to the initial velo- 
city, and c = sy, i.e. c is equal to the distance of the point at t = 0 
from the origin on the line. On putting these values for a, b, and c in 
the expression for s, we get the formula for the path with uniformly 
accelerated (w > 0) or uniformly decelerated (w < 0) motion: 


8 — wl 4 vot + So- 


In general, on knowing the law for the change of path, we can 
find the acceleration w by differentiating twice with respect to f, 
and hence find the force f producing the motion, since, by Newton's 
second law, f — mw, where m is the mass of the moving point. 

All the above applies only to linear motion. In the case of curvi- 
linear motion, as is shown in mechanics, f"(t) gives only the projection 
of the acceleration vector on the tangent to the trajectory. 
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We take the example of a point M oscillating harmonically on a 
line, so that its distance s from a fixed point O of the line is defined 
by the formula: 


s=asin( t+ o), 


where the amplitude a, the period of oscillation t, and the phase œ 
are constants. Differentiations give us the velocity v and the force f: 











2 2 
v= < cos ( ~t+o), j = mw = 
T T 
An? m R 27 4n? m 
m Er a sin (t+ o) = — a ; 


ie. the force is proportional in magnitude to the length of the interval 
OM and acts in the opposite direction. In other words, the force is 
always directed from the point M to the point O, being proportional 
to their distance apart. 


55. Differentials of higher orders. We now introduce the concept of 
higher order differentials of a function y = f(x). Its differential 


dy = f(x)dx 
is clearly a function of x, though it must be remembered that the 
differential de of the independent variable is reckoned as independent 
of x [50], so that it must be taken outside the differentiation sign 
as a constant factor on further differentiation. The differential of dy 
can be obtained by treating it as a function of x; this is called the second 
order differential of the original function f(x) and is denoted by 
d?y or d?f(a): 
d? y = d(dy) = [f'(x)dz]' dx = f"(x)da?. 

On obtaining the differential of this further function of x, we arrive 

at the third order differential: 


d? y = d(d? y) = [f"(a)da?]’ de = f’’’(x) da? , 


and in general, we arrive by successive differentiation at the concep: 
of the nth order differential of function f(x), which is expressed ast 


Pf or d y= adr. (2) 


This formula allows of the expression of the nth derivative as a 


fraction: 


d? 
A (3) 





f (2) = 
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We now consider a function of a function, y = f(u), where u is a 
function of some independent variable. We know [50] that the first 
differential of this function has the same form as when u is the inde- 
pendent variable: 

dy = f'(ujdu. 


Formula (2) is no longer valid for obtaining higher order differentials, 
since we are not justified in treating du as a constant when u is not 
the independent variable. For example, we obtain the second dif- 
ferential by using the rule for finding the differential of a product, 
with the result: 


d? y = d[f'(u)du] = du d[f'(u)] + f(u) d(du) = f"(u)du? + f'(u)d?u, 


which has the extra term f'(u)d? u, compared with formula (2). 

If u is the independent variable, du must be treated as constant 
and d? u = 0. We now take u as a linear function of the independent 
variable f, i.e., 

4 — at 4- 5. 


Now, du = a df, i.e., du is again a constant, so that the higher order 
differentials of the function of a function are given by (2): 


d" f(u) = J” (u) du”, 


ie. formula (2) for the higher order differentials is valid when x is 
either the independent variable, or a linear function of the independent 
variable. 


56. Finite differences of functions, We denote the increment of the 
independent variable by h. The corresponding increment of y = f(x) 
will be: 

Ay = f(x + h) — f(z). (4) 


An alternative name is the first-order difference of function f(a). 
This difference is, for its part, also a function of x, and we can find 
its difference by substracting from its value at (x + h) its value at x. 
This new difference is called the second-order difference of the original 
function f(x) and is denoted by 4%. We can easily express 4%y in 
terms of values of f(z): 


Ly = [f(x + 2h) — f(x + 8] — (fe + h) —f@)]= 
= fle + 2h) — ?f(e + h) + fle). (5) 
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This second order difference is also a function of x, and the difference 
of this function can be defined, giving the third-order difference of 
the original function f(x), denoted by 4%y. Replacing x by (x + h) 
on the right of (5), and subtracting the right of (5) from the result, 
we obtain the expression for 43 y: 


A y = [fle + 3h) — 2f(0 + 2h) + fle +h] — 
— [fle + 2h) — 2f(z + h) + fi] = 
= fe + 3h) — 3f(x + 2h) + 3fle + h) — f). 

We can thus go on to define the difference of any order, the nth 
order difference A” y being expressed as follows in terms of values 
of f(z): 

Avy = f(z + nh) — Gp flo +n+ 1h) + 


O etm — 2h) —... + (6) 


+ (— e Be — Dus ED. ie n — kh) + H(— 1)" f). 


We have seen above that this formula is true for n= 1, 2, and 3. 
A rigorous proof requires us to pass from to (n + 1) in the usual 
way. We note that (n + 1) values of f(x), for the values of the argu- 
ment: x, z + h, £ + 2h,...,x + nh, are needed for calculating A” y. 
These values of the argument form an arithmetical progression with 
difference h, or as we say, represent equidistant values. 

For small h, Ay differs little from the differential dy. Similarly, 
higher-order differences give approximate values of the differentials 
of corresponding order, and conversely. Whilst we lack an analytic 
expression for a function, we may be given a table for it for equidistant 
values of the argument; we cannot then calculate the various deriva- 
tives of the function accurately, but we can obtain approximate values 
by calculating the ratio 4”y/4x”, instead of using the accurate formula 
(3). As an example, we give a table of differences and differentials of 
the function y = 2° in the interval (2, 3), taking: 

Ax = h = 0.1. 


In setting up this table, the successive values of y = a? were 
calculated, then the values of Ay obtained from these, by subtraction 
in accordance with formula (4), then the values of 4%y obtained from 
the values of Ay by further subtractions, and so on. This method of 
calculating the differences successively is naturally simpler than 
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using formula (6). The differentials were calculated in the ordinary 
way, the formulae being given at the top of the table, where we 
must take de = h = 0.1. 

We compare the accurate and approximate values of the second 
derivative y” for x= 2. Here, y" = 6x and y” = 12 with x = 2. 
The approximate value is given by the ratio 4%y/h?, and we have 
for x = 2: 








0.126 
on? > 12.6. 

z y dy By ay Ay dy=32%dx | diy=6xdx* diy—6da* | diy=.0 
2 8.000| 1.261 | 0.126 | 0.006 0 1.200 0.120 0.006 0 
2.1 9.261, 1.387 | 0.132 | 0.006 0 1.323 0.126 0.006 0 
2.2 | 10.648} 1.519 | 0.138 | 0.006 | 0 1.452 0.132 0.006 0 
2.3 | 12.167] 1.657 | 0.144 | 0.006 0 1.587 0.138 0.006 0 
2.4 | 13.824| 1.801 | 0.150 | 0.006 0 1.728 0.144 0.006 0 
2.5 | 15.625; 1.951 | 0.156 | 0.006 0 1.875 0.150 0.006 0 
2.6 | 17.576} 2.107 | 0.162 | 0.006 0 2.028 0.156 0.006 0 
2.7 | 19.683 | 2.269 | 0.168 | 0.006 — 2.187 0.162 0.006 — 
2.8 | 21.952; 2.437 | 0.174 — — 2.352 0.168 — — 
2.9 | 24.389| 2.611 — — — 2.523 — — — 
3 21.000 — — — — 



































Tf f(z) is a polynomial of z: 
y = f(x) = aya" + 0,2714 a4 a7? tay B+ Am: 


it is easily seen that, on calculating 4y by (4), we get a polynomial 
of degree (m — 1) for Ay, with highest term ma, hz" !. Thus, with 
y = 2%, Ay is a polynomial of second degree in x, 4%y is a polynomial 
of first degree, 4%y is constant, and Aty is zero (see table). It is sug- 
gested as an exercise to the reader to show that the values of d?gy 
must be one step behind those of £y in our example, as is obvious 
from the table. 


8 5. Application of derivatives 
to the study of functions. 


97. Tests for increasing and decreasing functions. Knowledge of the 
derivative enables us to study the various properties of a function. 
We begin with the simplest and most basie question, of whether a 
function is increasing or decreasing. 
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The function f(x) is said to be increasing in an interval if it increases 
correspondingly with the variable thraughout that interval, ie. if 


f(a +h) —f(z)>0 for h>0. 
On the other hand, if 
fiz +h) — fle) <0 fo h>O, 


the function is said to be decreasing. 

Turning to the graph of the function, the interval in which the 
function is increasing corresponds to the section of the graph for 
which greater values of x imply greater values of y. If OX is directed 
to the right, and OY upwards, as in Fig. 55, the interval of increase 
of the function will correspond to the part of the graph where move- 
ment along it to the right in the direction of increasing abscissa 
implies movement upwards. An interval of decrease, on the other 
hand, corresponds to a part of the curve where we move downwards 
on moving along the curve to the right. In Fig. 55, the interval of 
increase corresponds to section AB of the graph, and the interval 
of decrease to BC. It is immediately clear from the figure that in the 
first section, the tangent 
forms with the direction of 
OX an angle a measured 
from OX to the tangent, 
the tangent of which is 
positive but the tangent of 
this angle is also the first 
derivative f’(x). In section 
BC, on the contrary, the 
direction of the tangent 
forms with the direction of 
OX an angle a (in the 
fourth quadrant), the tan- 
gent of which is negative, 
ie. f'(x) is negative in this 
case. Putting these results together, we arrive at the following rule: 
a function is increasing in the intervals for which f'(x) is positive, 
and is decreasing in intervals where f'(x) < 0. 

We have arrived at this rule by using the figure. A rigorous analytic 
proof is given later. We now make use of the rule in some examples. 





Fic. 55 
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1. We prove the inequality: 


A 23 
sin x >s- -y for x > 0. 


For this, we form the difference: 


j (x) = sina — («--}- 


We find the derivative f'(x): 
a 


F (£) = cos x —1 + 7 =-5-— (l — cos g) = 


a ; i 1 2 . 1 2 
2 OR — 
eS 2sin 3 ?[5 z) (sin 5 z) |: 


Since the absolute value of the arc is greater than its sine, we can say 
that f'(x) > 0 in the interval (0, o»), i.e. f(x) is increasing in this interval; 
and f(0) == 0, so that: 














3 
f(x) = sina — («- =) >0 for #>0, 
i.e. 


i x3 
Sing >g — -y for v > 0. 


2. It can be shown in exactly the same way that: 
x > log (1 + x) for a> 0. 


We form the difference: 


fa) = x — log (1 + 2), 
whence 
1 


Dee RISE 


It is evident from this expression that f'(x) > 0 for x > 0, ie. f(x) is 
increasing in the interval (0, + o»); but f(0) = 0, so that: 


f(z) = x — log (1 + x) > 0 fora>0, 
i.e. 
x > log (1 + x) for a> 0. 


3. We take Kepler’s equation, discussed in [31]: 
x = qsing -4-a (0 <q <1). 
We can write this in the form: 
Fa) = x — q sing —a = Q0. 


We get for the derivative: 
f'(x) = 1 — q cosg. 


We note that the absolute value of q cos x is less than unity, since q lies 
between zero and unity by hypothesis; we can thus say that f'(x) > 0 for 
any xw, so that f(x) is increasing in the interval (— co, + oo) and can there- 
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fore be zero only once, i.e. Kepler’s equation cannot have more than one 
real root. 

If the constant a is a multiple of x, i.c.a\= kx, where k is an integer, we 
obtain f(kx) = 0 on substituting «= kx directly, so that æ = kx is the 
unique root of Kepler’s equation. If a is not a multiple of x, we can find 
an integer k such that 

kx<a<(k+ 1) x. 

Substituting «= ka and (k+ 1) z, we get: 

f(kn) = kn —a «0, 
Jik + lx) = (k+ 1)—a>0. 


But if f(kz) and f(k + 17) have different signs, f(x) must be zero in the 


interval (kz, k + 13) [35], ie. the single root of Kepler’s equation lies in 
this interval. 


4. We take the equation: 
Fa) = 305 — 2523 + 6027 + 15 = 0. 
We find the derivative f'(x) and set it equal to zero: 
f'(x) = 15x4 — "152? + 60 = 15(ut — 5x? + 4) = 0. 
We solve this biquadratic equation and find that f'(x) is zero for 
= —2, —1, +1, and +2. 
We can now divide the total interval (— co, + co) into five intervals: 
(— eo, —2), (—2, —1), (—1, 1), (1, 2), (2, +o), 


so that f'(x) has the same sign in each, f(x) being therefore monotonic, i.e. 
either increasing or decreasing, and hence having not more than one root in 
each interval. If f(x) has different signs at the ends of a given interval, 
f(x) = 0 has one root in the interval; whereas if it has the same sign, there 
is no root in the interval. Thus, by finding the signs of f(x) at the ends of the 
five intervals above, we can find the number of roots of the equation. 

To find the signs of f(x) at x = + œ, we write f(x) in the form: 

n 25 , 60 , 15 
f(x) = a (3 -A ata) 

For 2> — oo, 49 — — oo, and the expression in brackets tends to 3, so 

Fx) > — œ. Similarly it can be seen that f(z) > 4-00 for > +œ. On 











substituting the values z — —2, —1, 1, and 2, we get the following table: 
à -e| -2| -1 :| 2 | += 
Ka = | = | - | +] +] 4 




















We see that f(x) has different signs only at the ends of the interval (—1, 1), 
so that the equation concerned has only onerealroot, which lies in this interval. 


132 DIFFERENTIATION: THEORY AND APPLICATIONS [58 


We defined above a function increasing or decreasing in an interval. 
A function is sometimes said to be increasing or decreasing at a point 
xz = xy The exact meaning is: f(x) is increasing at x = £y if f(x) < 
< f(x) for x < zp and f(x) > f(x) for x > x, x being taken suf- 
ficiently close to x). Similarly for a function decreasing at a point. 
The concept of derivative leads directly to a sufficient condition 
for a function to be increasing or decreasing at a point, viz., if f(x) > 0, 
f(x) is increasing at xp and if f'(x.) < 0, f(z) is decreasing at x. 
If, e.g., f'(x) > 0, the ratio: 


Í (£o + h) — f(x) 
h 3 


with limit f'(z;), will also be positive for all h with sufficiently smal 
absolute values, i.e. numerator and denominator will have the same 
sign. In other words, we shall have f(x, + h) — f(a) > 0 for h > 0, 
and f(x) + h) — f(x) < 0 for h < 0, so that f(x) is increasing at x. 


58. Maxima and minima of functions, We again turn to the graph 
of some function f(x) (Fig. 56). We have a successive alternation of 
intervals of increase and decrease in this case. Segment AM, corres- 
ponds to an interval of increase, 
the next segment M, M, to one of 
decrease, the next, M, M, again 
to one of increase, and so on. Peaks 
of the curve separate intervals of 
inerease from those of decrease. 
Take the peak M,, for example; 
its ordinate is greater than all the 
ordinates of the curve that are 
sufficiently close to the peak, 
whether to the left or right. Such 
a peak is said to correspond to 
a maximum of f(x). 

This leads us to the following general analytic definition: the func- 
tion f(x) attains a maximum at the point x = x, if its value f(x,) at 
this point is greater than its values at all neighbouring points, ie. if 
the increment of the function 


fo, + h) — f(x) <0 


for all positive or negative h, sufficiently small in absolute value. 
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We now consider the peak M,. The ordinate in this case is less than 
all neighbouring ordinates, whether on the left or right; the peak 
is said to correspond toa minimum of the function, which is defined 
analytically as follows: the function f(x) attains a minimum at the 
point x = x,, if we have 


f(x, + h) — f(x) > 0 


for all positive or negative h, sufficiently small in absolute value. 

We see from the figure that the tangents at peaks corresponding 
to either maxima or minima of the function f(z) lie parallel to the 
axis OX, ie. their slope f'(x) is zero. The tangent to a curve 
can be parallel to OX, however, elsewhere than 
ata peak. In Fig. 57, for example, point M y 
of the curve is not a peak, yet the tangent 
at M is parallel to OX. 

Suppose that f’(z) becomes zero for some 1(x)>0 
value x= £, ie. the tangent at the corres- 
ponding point of the graph is parallel to OX. 
We consider the sign of f'(x) for x near xy 

We take the following three cases: 

1. For v less than, and sufficiently near 2), f'(x) is positive, whilst 
f'(x) is negative for x greater than, and sufficiently near z,, ie. in 
other words, as x passes through xp, f'(x) passes through zero from 
positive to negative values. 

In this case, we have an interval of increase to the left of x = z£, 
and an interval of decrease to the right, ie. x = x, corresponds to 
a peak of the curve, where f(x) is a maximum (Fig. 56). 

IL f'(x) is negative for x less than x), and positive for x greater 
than x,, i.e. f'(x) passes through zero from negative to positive values. 

In this case, we have an interval of decrease to the left of x = £o, 
and one of increase to the right, ie. x = x, corresponds to a peak 
of the curve that gives a minimum of the function (Fig. 56). 

III. f’(x) has the same sign for x either less than or greater than zo. 
Suppose, for instance, that the sign is (+). 

The corresponding point of the graph in this case lies in an interval 
of increase, and is clearly not a peak (Fig. 57). 

The above remarks lead us to the following rules for finding the 
values of x for which f(z) has a maximum or minimum: 


(1) find f’(x); 
(2) find the x for which f’(x) is zero, i.e. solve the equation f’(x) = 0; 


f{x)>0 
M 
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(3) find how the sign of f’(x) varies on passing through these values, 
using the following arrangement: 





z zQ,—h Zo zh f) 
+ — maximum 
f(z) — 0 + minimum 
+ + increasing 
= — decreasing 











In the above table xj — h and x, + h indicate that the sign of 
f(x) is to be determined for v less than, and greater than, x), and 
always sufficiently close to x, so that h is taken as a sufficiently small 
positive number. 

It is assumed in this investigation that f'(x) = 0, whilst f'(x) 
differs from zero for all x sufficiently near, but not coinciding with, x,. 

We return to the fact that the tangent at M in Fig. 57 (with ab- 
scissa xo) lies on different sides of the curve in the neighbourhood of 
M. Here, f'(z,) = 0 and f'(x) > 0 for all x near to, but not at, 2, 
all sections of the curve with x, as an interior point giving intervals 
of increase, notwithstanding the fact that f’(x,) = 0. 

Sometimes a rather different definition from the above is given of 
a maximum, viz., the function f(x) has a maximum at x= 2, if f(z) is 
not less than the values of f(x) at neighbouring points, ie. if we 
have for the increment of the function, f(x, + b) — f(x,) < 0, for 
all h sufficiently small in absolute value, and either positive or negative. 
A minimum at the point x = x, can be similarly defined by the 
inequality f(x, + b) — f(x.) > 0. If the function in this definition 
has a derivative at its maximum or minimum, this derivative must 
be zero, as above. 


To take an example, let it be required to find the maxima and minima 
of the function 
fo) = (x — 1} (x — 2). 


We obtain the first derivative: 
Fu) = 2( — 1) (æ — 2)? -+ 3@— 1)? (s — 2)? = 


= (z — 1) (æ — 2} (5z — 1) = 8 — 1) (æ — 2) ( -+ 
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It is clear from the last expression that f’(x) is zero for the following values 
of the independent variable: x, = 1, a, = 7/5, v, = 2. 

We now investigate these. For «= 1, the factor (« — 2)? has a plus 
sign, and (x — 7/5) a minus sign. For all x sufficiently close to unity, and 
either greater than or less than unity, each of these factors keeps the same 
sign, so that their product is unconditionally minus for all x sufficiently near 
unity. We finally consider the factor (x — 1), which is in fact zero at x = 1. 
It is negative for x < 1, and positive for x > 1. Thus the complete derivative, 
i.e. f(x), has a plus sign for z < 1 and a minus sign for x > 1. Hence it follows 
that x = 1 represents a maximum of the function f(x). We set «= 1 in 
the expression for f(x) itself, thus obtaining the value of the maximum, 
i.e. the ordinate of the corresponding peak of the graph of f(x): 


fü) = 0? - (—1)8 = 0. 


By using the same sort of argument for the other values 2, = 7/5 and 
x, = 2, we obtain the following table: 






































7 7 7 
x 1-—h 1 |1+4% Fo? * gt? 2—h 2 |2--h 
reo |+]of-| - | o + |+] o+ 
| —108 
f(z) | incr. 0 decr. 3125 increases 
max : 
min. 











The method we have outlined for studying the maxima and minima 
of a function may have the drawback, especially in more complicated 
examples, that it is not too easy to find the sign of f'(x) for x both 
greater than, and less than, the value in question. The trouble may 
be avoided in many cases by taking the second derivative f"(x) into 
account. Suppose we consider x = z, for which f'(x,) = 0. 
We shall assume that, on substituting x = x, in the expression for 
the second derivative, we obtain a positive value, ie. f"(z,) > 0. 
If f'(x) is taken as the basic function, the fact that its derivative f”(x) 
is positive at x =x, means that the basic function itself, f(x), is 
increasing at this point, i.e. f'(x) passes from negative to positive 
values at its zero-point z,. Thus, for f"(x,) > 0 at the point x = zy 
f(x) will have a minimum. It can similarly be shown that, for f"(x,) < 0 
at x= Zy f(x) has a maximum. Finally, if we get zero on substitut- 
ing x = x, in the expression for f'(x), ie. f'(x,) = 0, this prevents 
us from using the second derivative to investigate x = xz, and we 


136 DIFFERENTIATION: FHEORY AND APPLICATIONS [58 


must turn back to direct consideration of the sign of f(x). The arrange- 
ment shown in the table is thus obtained: 








z | 1) | fr) | 1@) 
Lo 0 — maximum 
+ minimum 
0 doubtful case 





It follows directly from the above discussion that, given the existence 
of the second derivative, a necessary condition for a maximum is 
f'(x) « 0, and a necessary condition for a minimum is f”(x) > 0. 
We can determine the maximum here by the condition f(z, + h) — 
— f(%,) < 0, and the minimum by f(x, + h) — f(z.) > 0, as described 
above. 


Example. It is required to find the maxima and minima of the function 


fa) = sin x + cos a. 


This function has period 2z, i.e. remains unchanged on substituting x + 2x 
for x. 

It is sufficient to consider x varying in the interval (0,2). 

We find the first and second derivatives: 


f'(x) = cos x — sina; f’(x) = —sin x — cos g. 
We put the first derivative equal to zero, obtaining the equation: 
cos — sing — 0 or tana - 1. 


The roots of this equation in (0,22) are: 


7 5r 
m m and t = 7 


We investigate these values by finding the sign of f”(w%): ` 














{a L 2 d ow LAE. TY ys. 

/ i ) sin q cos -7 [2 <0; maximum f (+ )> V2; 
a 5n . Om . 57 ys . un Bray — 5 
Í ( 4 } sin -7 cos —7 /2> 0; minimum 1) == 2. 


We note in conclusion a point that sometimes arises in finding maxima 
and minima. Points may occur on the graph of the function where there is 
either no tangent at all, or the tangent is parallel to OY (Fig. 58). The 
derivative f'(x) will not exist at points of the first kind, whilst it is 
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infinite at points of the second kind, since the slope of a line parallel to 
OY is infinity. As is evident at once from the figure, however, these 
points can represent maxima or minima of the function. The above rule 
for finding maxima and minima should thus be amplified as follows, 
strictly speaking: maxima and minima of a function f(x) can occur not only 
at points where f'(x) is zero, but also at points where f'(x) either does not exist 
or becomes infinite. Investigation of these latter points must be carried out 
by the first of the arrangements given above, i.e. by finding the sign of 
f'(x) for x less than, and greater than, the value in question. 
Example. It is required to find the maxima 
and minima of Y ! 


3 


f(a) = (2 — lat. 


2 

X= — 

== —1 5 
ra= Tr eiue es 5 


l 
l 
We find the first derivative: | 
l 
l 
| 





3 3 ds 3_ f x 
ala Va 
This is zero for z = 2/5 and infinity for x = 0. 
We consider the latter value: the numerator 
of the above fraction is minus for zero z, and also minus for all positive or 
negative x that are near zero. The denominator of the fraction is negative for 
x<0, and positive forz>0.The fraction as a whole is therefore positive for x 
less than, and near, zero, and negative for x > 0, i.e. we have a maximum at 
æ = 0, f(0) = 0. We have a minimum at z = 2/6: 


Fig. 58 


A eee 


59. Curve tracing. Finding the maxima and minima of a function 
f(x) considerably simplifies the problem of tracing its curve. This 
is explained by means of a few examples. 


1. Let it be required to trace the graph of: 
y = (x — 1? (x — 2), 


which we considered in a previous paragraph. We then obtained two peaks 
of the function: a maximum (1,0) and a minimum (7/5, — 108/3125). We fill 
in these points in the figure. It is also useful to mark the intercepts of the 
curve on the axes. We have for x = 0, y = —8, i.e. the point on axis OY 
is y = —8. 

We put y equal to zero, i.e., 


(a — 1} (æ — 2)? = 0, 
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and obtain the points on axis OX. One of these, x —1, is a peak, as already 

explained, whilst the second, « = 2, is not a peak, as was also explained 

earlier, but corresponds to a point of the graph where the tangent is parallel 
to OX. The required curve is shown in Fig. 59. 


Y4 2. We trace the curve: 
y= oe, 


We find the first derivative: 


y! = —2a0 *, 


Putting y’ = 0, we get the value z = 0, correspond- 
X ing to a peak (maximum) of the curve, as is easily 
Ü seen, with ordinate y — 1. This also gives us the 
point of the curve on OY. Putting y zero, we get 
the equation e7** = 0, which has no solution, i.e. 
the curve has no points on OX. But we note that 
as x tends to (--ec) or (— co), the power in e—** 
tends to — oo and the expression as a whole tends 
to zero, i.e. on infinite displacement to left or 
right, the curve indefinitely approaches the axis OX. 
The curve is shown in Fig. 60, in accordance with 

the data obtained. 


Fig. 59 3. We draw the curve 
y=e “sin be (a> 0), 
representing a damped oscillation. The factor sin bx does not exceed unity in 
absolute value, and the graph as a whole is situated between the two curves: 


y=e"% and y=-—er*, 


As 2 tends to (+2), factor e—%, and hence the product e~™ sin bx as a 
whole, tends to zero, i.e. the curve approaches OX asymptotically on infinite 
displacement to the right. The points of the curve on OX are given by the 
equation 


sin bz = 0, 
so that they are 
gz = H (k is an integer). 
Y 





Fra. 60 
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We find the first derivative: 
y’ = —ao "E sin bz + be-?* cos be = e” “*(b cos bx — a sin bx). 
The expression in brackets can be put in the well-known form: 


b cos bx — a sin bz = K sin (bx + Po), 








Fia. 61 


where K and q, are constants. On putting the first derivative equal to zero 
we get the equation: 
sin (bz + po) — 0, 
which gives: 
ka — 


5 Po (k is an integer) (1) 


bx + py = kn, ie. c= 


When x passes through these values, sin (bx + p,) changes its sign each 
time. The same can evidently be said of the derivative y”, since, 


y' = Ke~™ sin (bx + po) , 
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and the sign of the factor e * is constant. There are thus alternate maxima 
and minima corresponding to these roots. We should have a sine wave with- 
out the exponential factor: 


y=sinbx, 


and the abscissae of its peaks would be given by: 


cosbr=0, 
i.e. 

z= GE (k an integer). (14) 
We thus see that the exponential factor not only reduces the amplitude 
of the oscillation, but also displaces the abscissae of the peaks of the curve. 
It is evident, on comparing equations (1) and (1,), that this displacement is 
equal to the constant (— z/2b — gb). The graph of 
Y the damped oscillation is shown in Fig. 61 for a = 1 
and b = 2x. The peaks of the curve do not lie on the 
dotted lines corresponding to y = +e—™, This is due to 

the above mentioned displacement of the peaks. 


4. We draw the curve: 





x3 — 9x 
Ü X y= —76 . 
We find the first and second derivatives: 
, x2? — 1 » 
Y= =e. 
Fic. 62 We put the first derivative equal to zero, and find 
the two values z; = 1 and v, = —1. Substituting 


these values in the second derivative, we see that 
the first value represents a minimum, and the second a maximum. 
We substitute these values in the expression for y, finding the corresponding 


peaks of the curve: 
1 1 
Er =). ( -3) 


Putting x = 0, we get y = 0, i.e. the origin (0,0) lies on the curve. Finally, 
we get for zero y, apart from x= 0, two further points x = +73, so that 
the curve cuts the axes in three points, (0,0), (/3,0) and (—y3,0). We note 
further that, on simultaneously replacing zy and y by —=w and —y, both 
sides of the equation of the curve only change sign, i.e. the origin is a centre 
of symmetry of the curve (Fig. 62). 


60. The greatest and least values of a function. We assume that the 
values of the function f(x) are considered for values of the independent 
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variable x in the interval (a, b); it is required to find the greatest 
and the least of these values. If f(x) is continuous, it will certainly 
attain a greatest and least value, as we saw in [35], so that the corres- 
ponding graph will have a greatest and a least ordinate in the interval 
in question. We can find all the maxima and minima of the function 
inside the interval (a, b) in accordance with the rule given above. 
Tf the function g(x) has its a greatest ordinate inside this interval, this 
will evidently coincide with the greatest maximum of the function 
inside the interval. It can happen, however, that the greatest ordinate 
is not inside the interval, but at one of the ends x =a or x= b. 
Hence it is not sufficient, in finding say the greatest value of a 
function, to compare all its maxima 

inside the interval and take the Y greatest value 
greatest, since its values at the ends 
of the interval must also be taken 
into account. Similarly, the least 
value of a function must be found 
by taking all its minima inside the 
interval, together with the boundary 
values of the function or x =a 
and x — b. We remark here that 
maxima and minima can be completely absent, yet a greatest and 
least value must exist for a continuous function in a bounded inter- 
val (a, b). 

We note some particular cases, when the greatest and least values 
can be found very simply. If, for example, f(x) is increasing in (a, 5), 
it will obviously take its least value at x = a, and its greatest value 
at x = b. The opposite will be the case for f(x) decreasing. 

If the function has a single maximum inside the interval, with 
no minima, this single maximum gives the greatest value of the func- 
tion (Fig. 63), so that it is quite unnecessary to find the values at the 
ends of the interval in order to find the greatest value of the function 
in this case. Similarly, if the function has a single minimum inside 
the interval, with no maxima, the minimum gives the least value 
of the function. The cases just mentioned are met with in the first 
of the four examples given below. 





1. It is required to cut a given line, of length 1, into two parts, such that the 
rectangle formed from these parts has a maximum area. 

Let x be the length of one part of the line, so that the length of the other 
part is (1 — x). Since the area of a rectangle is the product of its neighbouring 
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sides, we see that the problem reduces to finding the value of x in the inter- 
val (0,2) for which the function: 


Ha) = e(l — a) 
attains its greatest value. 
We find the first and second derivatives: 


Fi) = (l — x) —a=1 — 20; f"(x) = —2 «0. 


Equating the first derivative to zero, we get a single value « = 1/2, which 
corresponds to a maximum, since f’(x) is a negative constant. The greatest 
area is thus obtained with a square of side 1/2. 

2. A sector is cut out from a circle of radius R, and the remainder is glued 
together into a cone. It is required to find for what angle of the sector cut out the 
volume of the cone has its greatest value. 

Instead of taking the angle of the sector cut out as the independent variable, 
we take the difference between this angle and 27 as x, i.e. x is the angle of 
the remaining sector. For x near 0 or 2x, the volume of the cone approaches 
zero, so that there is evidently a value of x in the interval (0, 2x) for which 
the volume is greatest. 

The cone obtained by glueing together the remainder of the circle must 
have a generator of length R, a length of circular base Rx, a radius of base 
r = Ra/2x, and height: 


—— EG ED 
rm EE ra. 


The volume of this cone will be: 


2 AA TE 
Vayccbqu O pe A 


R3 Atte 
2. Van? — 22. 
3 re 27 a e Vi. 








The constant factor R%/24m% can be neglected when finding the greatest 


value of this function. The remaining product x? Van — x° is positive, and 
therefore attains its greatest value for the same x as that for which its square 
attains its greatest value. We can thus consider the function: 


Fo) = 4a? at — at 
inside the interval (0,27). 
We find the first derivative: 
Fix) = 161? 23 — 625. 


It exists for all x. Equating it to zero, we obtain three values: 


2,=0, a= — 27 En e E. 


The first two values do not lie inside the interval (0,22). The single value 


w= 21 2/3 remains, inside the interval; and since we saw above that the 
æ giving the greatest volume of cone must lie inside the interval, we can say 
without investigating xz, that this is the required x. 
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3. A straight line L divides a plane into two parts (media) I and II. A point 
moves in medium I with speed v,, and in medium II with speed v,. In what 
path must a point move in order to pass as quickly as possible from point A 
in medium I to point B in medium II? 

Let AA, and BB, be the perpendiculars from A and B onto L. We introduce 
the following notation: 


AA =a, BB,—b, A,B,=0, 


and the abscissae will be measured on L in the direction A,B, (Fig. 65). 
It is obvious that the point must have a straight-line path in both media, 
I and II, though generally speaking, line AB will not represent the ‘quickest 


path”. The “quickest path” will thus consist of two straight sections AM 
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and MB, with point M lying on L. We take tho abscissa of M as independent 
variable x (x = A, M). We want the least value for time t, given by: 

















AM , MB Væp VBF leo 
a ay REC QE 
in the interval (— oo, + oo). 
We find the first and second derivatives: 











V(t) = i ANA da e 
vila? -+ a? val b: + (e — x)? 
fio = a 


vi (a? + x?)3/2 RH v,L b? + (e — at] 


Both derivatives exist for all v, and f"(x) is always positive. Hence, f'(z) 
is increasing in the interval (— co, + oo), and cannot become zero more 
tban once. But 





f 0 = — ———— 
F (0) LI 
and 
f (c) = EN MUN 0, 
vla? + c? 


and hence the equation 


Fa) = 0 
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has a single root zr, between 0 and c, giving the single minimum of f(x), 
since f^(z) > 0. The abscissae 0 and c represent points A, and B,, so that 
the required point M will lie between A, and B,, as could also be shown 
by elementary geometry. 

We give the geometrical interpretation of the solution obtained. Let «a 


and f denote the angles formed by segment AM and BM respectively with 
the perpendicular to L at M. The abscissa x of the required point M must 
make f'(x) zero, i.e. must satisfy the equation: 


x ce 
E ope- 








which can be rewritten as: 














AM | MB, 
v AM v- BM 
or 
sina  sinf i sina v, 
== , C. 7 ELM 18 
[A v, sin B Us 


“the quickest path" will be that for which the ratio of the sines of the 
angles a and £ is equalto the ratio of the speeds in media J and II. This result 
gives us the well-known law for the refraction of light, so that rofraction 
takes place with the ray of light choosing the “quickest path" from a 
point in one medium to a point in another. 

4. We suppose that the experimental determination of a magnitude x 
involves making » individual careful observations, giving n values: 


Qis 05, «++, On, 


which are not identical due to instrument inaccuracies. The value of zx 
giving the least sum of squares of errors is taken as the “best” value. Find- 
ing this best value thus means finding the zr giving the least value of the 
function 


f(x) = (x — a4)? + (1 — a +... + (£ — an) 
in the interval (— oo, + co). 

We find the first and second derivatives: 
Flo) = 2(@ — ay) + (@ — a) + ... + 2(2 — an), 
f'(x)—242-F...--2— 2n» 0. 

Making the first derivative zero, we get a single value: 


— a, +07 +...+ an 


n 





$ 


which represents a minimum, since the second derivative is positive. Thus, 
the *'best value of x is the arithmetic mean of the observational results. 
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5. To find the shortest distance from a point M to a circle. 
We take the centre of the circle as origin O, and the line OM as axis OX. 
Let OM = a, and let R be the radius of the circle. The equation of the circle 


18: 
e+y=R, 


and the distance of M, with coordinates (a, 0), to any point of the circle will 
be: 
Vía — ay + y. 
We shall find the least value of the square of this distance. On substitut- 
ing for y? from the equation of the circle, we get the function : 


fle) = (0 — ay + (Rt — at) = —2az + a? + BP, 


where the independent variable x lies in the interval (—R < x < R). 





Fra. 66 Fia. 67 


Since the first derivative f'(x) = —2a is negative for all æ, function f(x) 
is decreasing, and hence attains its least value for « = R at the right-hand 
end of the interval. The shortest distance is thus given by PM (Fig. 66). 

6. Given a right circular cone, it is required to inscribe in it a cylinder with 
the greatest total surface area. 

Let the radius of base of the cone be R and its height H, and let the radius 
of base and height of the cylinder be r and h. The function whose greatest 


value is required is here: 
S = 211? + 2nrh. 


Variables r and h are connected, due to the fact of inscribing the cylinder 
in the given cone. We have from the similar triangles ABD and AMN 


(Fig. 67): 





MN BD 

AN AD’ 
or 

h H 

R—r R’ 
whence 

havi" g. 
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We obtain on substituting for h in the expression for S: 


S= afre + rH (1-)]. 


Hence, S is a function of the single independent variable r, which must 
lie in the interval 0 <r < R. We find the first and second derivatives: 


ds 2r d2s H 
Pros dp 9 A = — =|. 
Jr 22 (27 +H A). dy 4n(1 x) 
We find a single value of r, on making dS/dr zero: 
HE 
r= 3 — Ry . (2) 
For this value to lie inside (0, R), the inequalities must be satisfied: 
HR HR 
0< ZUR and 3(H-—R) « R. (3) 


The first inequality is equivalent to the condition that H > R. We multiply 
both sides of the second inequality by the positive term 2(H — E) and get: 


1 
RH. 


With fulfilment of this condition, d?S/dr? is negative; the value (2) gives 
the single maximum of function S, and the greatest surface-area of the cylin- 
der. The size of the latter can easily be found by substituting for r from (2) 
in the expression for S. 

We now take the case of value (2) not lying inside (0, E) i.e. one of the 
inequalities (3) is not satisfied. Two possibilities can now arise: either H < E 
or H > H, but R> H/2. Both these can be characterized by the single 
inequality: 


H < 2R. (4) 
We rewrite dS|dr as: 
ds 2r 27 
Gen (274 2 FA) = FLOR —H)r + HQ — 7). 


It is clear from the new expression that dS/dr > 0 for 0 <r « E on 
satisfying (4), i.e. S is increasing in (0, E), and thus attains its greatest value 
for r = R. Evidently, h = 0 with this value of r, and we can look on the 
result obtained as a flattened cylinder, the base of which coincides with the 
base of the cone, and the total surface-area of which gives 27R*. 


61. Fermat’s theorem. We have used elementary geometry above, 
to give methods for studying the increase and decrease of a function, 
as well as for finding its maxima and minima and its greatest and 
least values. We now turn to the rigorous analytic statement of 
some theorems and formulae, which give us an analytic proof of 
the rules found above, and also enable the study of functions 
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to be carried somewhat further. When stating the next theorems 
and formulae, we shall include a detailed and precise account of all 
the conditions for which they are valid. 

FERMAT'S THEOREM. Jf f(x) is continuous in (a, b), has a derivative 
at every interior point of the interval, and attains its greatest (or least) 
value at some interior point x = c, the first derivative must be zero at 
g =c, Le. f'(c) = 0. 

We suppose for clarity that f(c) is the greatest value of the function. 
The proof will be exactly similar, in the case of its being the least 
value. In accordance with the condition that x = c lies inside the 
interval, the difference f(e + h) — f(c) will be negative, or at any 
rate not positive, for any positive or negative h: 


J(c + h) —f(c) < 0. 
We take the ratio: 
f (c 4- h) — f (c) 
GRE VEIT 


The numerator of this fraction is less than or equal to zero, as 
remarked, so that: 


Igt59-10 <9 for h — 0, (5) 
109-19). > 0 for h<0. 


Point z — c lies inside the interval, and a derivative exists at 
the point by hypothesis, ie. the above fraction tends to a definite 
limit f'(c), when h tends to zero in any manner. We first suppose 
that À tends to zero through positive values. Passing to the limit 
in the first of inequalities (5), we get: 

Fc) « 0. 

Similarly, passing to the limit with h — 0 in the second inequality 

of (5) gives: 
fic) > o. 

Taking these two inequalities together, we obtain the required 
result: 

Fc) = 9. 

62. Rolle’s theorem. Jf f(x) is continuous in (a, b), has a derivative 
at every interior point of the interval, and has equal values at the ends 
of the interval, ie. f(a) = f(b), then there exists at least one interior 
point, x = c, such that the derivative is zero, ie. f’(c) = 0. 
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A continuous function f(z) must havee a last value m and a greatest 
value M in the interval considered. 1f the least and greatest values 
happened to coincide, i.e. m = M, it would follow that the function 
kept a constant value m (or M) throughout the interval. We know 
that the derivative of a constant is 
zero, so that in this simple case, the 
derivative would be zero at every in- 
terior point of the interval. Turning to 
the general case, we can thus suppose 
that m < M. Since the function has 
the same value at the limits, ie. f(a) 
= f(b), by hypothesis, at least one of 
the numbers m or M must differ from 
this common value. Suppose, for example, that it is M, ie. the 
greatest value of the function is reached inside, and not at the limits, 
of the interval. Let x = c be the point at which this value is reached. 
By Fermat’s theorem, we shall have f’(c) = 0 at this point, which 
proves Rolle’s theorem. 


Y 








-/ 0 / 
Fie. 69 Fie. 70 


In the particular case of f(a) = f(b) = 0, Rolle’s theorem can be 
briefly formulated as: the first derivative of f(x) will vanish at at 
least one point in (a, b). 

There is a simple geometrical interpretation of Rolle’s theorem. 
By hypothesis, f(a) = f(b), i.e. the ordinates of the curve y = f(x) 
are equal at the ends of the interval, and a derivative exists inside 
the interval, ie. the curve has a definite tangent. Rolle’s theorem 
says that there exists at least one interior point in this case, where 
the derivative is zero, ie. where the tangent is parallel to axis OX 
(Fig. 68). 

Remark. If the condition regarding the existence of the derivative 
f(x) is not fulfilled for every interior point of the interval in Rolle's 
theorem, the theorem can be proved false. 
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Take, for example, 
3 — 
f(a) =1— laz; 


this is continuous in the interval (—1, 1), and f(—1) = f(1) = 0, 


but the derivative, 


F @) = — 


3 
3 yz 
does not vanish inside the interval. This follows from the fact that 
f'(x) does not exist (tends to infinity) for v = 0 (Fig. 69) Another 
example is the curve shown in Fig. 70. Here we have the curve y = f(x), 
with f(a) = f(b) = 0. But it is evident from the figure that the tangent 
cannot be parallel to OX inside the interval (a, b), i.e. f'(z) does not 
vanish. This happens because the curve has two different tangents 
at x = a, to the left and right of this point, and hence a unique 
derivative does not exist at this point, so that the condition of Rolle’s 
theorem regarding the existence of the derivative at every interior 
point is not fulfilled. 


63. Lagrange's formula. We assume that f(x) is continuous in (a, b), 
and has a derivative inside this interval, but that the condition 
f(a) = f(b) of Rolle's theorem cannot be satisfied. We form a new 
function: 


F(x) = f(a) +10, 


where 4 is a constant, so defined that the new function satisfies the 
condition mentioned for Rolle’s theorem, i.e. we make: 


F(a) ==, F(b) , 
so that 
f(a) + Aa =f) + Ab, 
whence 
d bem f(b) — f (a) 
v b—a ` 


Applying Rolle’s theorem. now to F(x), we can say that there will 
be a point x = c between a and b, where 


F'(c) —f'(c) +A=0 (a<c<b), 
whence, on substituting the above expression for à: 


Pelo rio) n 


b—a 
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The last equation can be written: 
f(b) — fla) = (b — a) Flo). 


This equation is called Lagrange’s formula. The value c lies between 
a and b, so that the ratio (c — a)/(b — a) =0 lies between zero and 
unity, and we can write: 


c=a+6b—a) (0<0<I1), 
and Lagrange's formula now takes the form: 
f(6) — fla) = (b — a) f'(a + 6 —a) (0-8 <1). 
Putting b — a + h, the formula becomes: 
Fa + h) — fla) = hf'(a + oh). 


Lagrange’s formula gives an accurate expression for the increment 
f(b) — f(a) of the function f(x), so that it is also called the formula of 
finite increments. 

We know that the derivative of a constant is zero. Lagrange’s 
formula allows us to state the converse: if the derivative f'(x) is zero 
for every point of (a, b), f(x) is constant in the interval. 

In fact, taking an arbitrary x of (a, b), and applying Lagrange's 
formula to the interval (a, x), we get: 


Ha) — f(a) = (x —af'(&) (a< $< z); 
but f'(£) = 0 by hypothesis, hence: 
Ha) —f(a) = 0, ie. f(x) = f(a) = constant. 


All we know as regards the magnitude c that appears in Lagrange’s 
formula is that it lies between a and b, so that the formula does not 
enable us accurately to calculate the increment of a function from 
its derivative; the formula can be used, however, to estimate the 
error involved in replacing the increment of a function by its dif- 
ferential. 


Example. Let 
Ho) = logy, x. 

The derivative is 
1 


mA mor 
fe ‘0 De 43429...), 
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and Lagrange's formula gives us: 
M 
logio (a + h) eae AS 
or 
log, (a + h) = log agha M 
10 10 a+ 0h ` 
Replacing the increment by the differential gives us the approximate for- 


mula: 


M 
logy, (a + k) — log,pa = i= , log,p (a + h) = log, a + nae . 


On comparing this approximate equation with the accurate one, obtained 
with Lagrange's formula, we see that the error is: 
M M 6h? M 


h a -Aaroh uia EUR) 





Y 


Putting a = 100 and h = 1, we get the 
approximate equation: 


logo 101 = log,, 100 + T = 2.00434... 


with the error 


0xM 
100 (100 + 0) 





(0<@<1). 


Replacing 0 by unity in the numerator, and by zero in the denominator, 
of this fraction, we can say on evaluating the fraction that the error in cal- 
culating the value of log,, 101 is less than 


M 


Toor = 9.00004... 


We can rewrite Lagrange’s formula as: 


MES BUR arp (a<c<b). 


b—a 


We see from the graph of y = f(x) (Fig. 71) that the ratio 





£b) -fa _ CB__ 
pa 5 tan/ CAB 
gives slope of the chord AB, while f’(c) gives the slope of the tangent 
at some point M of the segment AB of the curve. Lagrange’s 
formula thus amounts to the assertion: there is a point in a segment 
of a curve where the tangent is parallel to the chord. Rolle’s theorem 
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is a special case of this assertion, when the chord is parallel to OX, 
ie. f(a) =f). 

Remark. The tests for increase and decrease, which we established 
above from the graph, follow immediately from Lagrange’s formula. 
We suppose that the first derivative is positive inside a certain 
interval, and we let x and x + h be two points of this interval. 
Evidently, from Lagrange’s formula: 


fe +h) —f() =hf(4 0h) (0<0<1, 


the difference on the left will be positive for positive h, since both 
factors of the product on the right are positive in this case. The 
assumption that the derivative is positive in some interval thus 
leads us to: 


f(x +h) —f() >o, 


ie. the function is increasing in this interval. The test for a decreas- 
ing function similarly follows directly from the formula written above. 

We also note here that the argument used in proving Fermat's 
theorem is fully applicable to the case when the function reaches 
a maximum or minimum, and not necessarily its greatest or least 
value, at the point in question. This argument shows us that the 
first derivative must be zero at such points, if it exists. 


64. Cauchy’s formula. We take f(x) and g(x) continuous in (a, b), 
with derivatives at every interior point of the interval, and with 
g'(x) not zero at any interior point. We apply Lagrange's formula 
to g(x), giving 


p(b) — pla) = (b —a)g'()  (a<e, <b), 
with p(c,) 4 0 by hypothesis, whence 
p(b) — pla) # 0. 
We form the function: 
F(x) = f(x) + Aplz) , 
where À is a constant, so defined that F(a) = F(b), ie 
Fla) + Àg(a) = f(b) + 29(b) , 


inco He) = Ka) 
95) — e(a) ` 


whence 
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Rolle’s theorem is applicable to F(x) with this choice of A, giving 
us the existence of x — c such that 


Fc) = f'(c) + Apc) -0 — (a «e«t. 
This equation gives: 


fo _ y 
g (c) xci À (9 (c) * 0), 


whence, on substituting the value found for A, we get: 
fb)—f(o) _ fc) 








¿Oy wo Ced 
or 
fb — fía) _ f [a +9(b—a)] 
9(6)— pa) — y [a + 6(6 — a)] (0<0< 1), (6) 
or 


fía +»h)—fla) _ f (a+) 
9(a--h)— pla) ^ p (a+ 0h) ` 
This is Cauchy's formula. On putting g(x) = 2 in this formula, 
we have g'(z) = 1, and the formula becomes: 
f(b) —1(a  . flo) 
1 


b—a 





or 
fo) — f(a) = (b — af), 


ie. we get Lagrange's formula as a special case of Cauchy's formula. 


65. Evaluating indeterminate forms. If two functions g(x) and 
(x) vanish at x = a, the fraction g(x)/y(x) is an indeterminate form 
of the type 0/0 at x — a. We indicate a method of evaluating such 
indeterminate forms. We suppose that g(x) and y(x) are continuous 
and have a first derivative near x = a, whilst p’(x) does not vanish 
for x near, but not equal to, a. 

We prove the following theorem: if, with the above assumptions, 
y’(x)/p’(x) tends to a limit b as x tends to a, then p(x)/p(x) tends to the 
same limit. 

Noting that: 

yla) = gla) = 0, 


and using Cauchy’s formula [64], we get: 





pa) _ e(23-—9(a) _ yl) 
v(z) ^ wm) wa) E E betssen Ande. eV 
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We remark that we are justified in applying Cauchy's formula, 
given the assumptions made regarding g(x) and y(x). 

If x tends to a, £, lying between x and a, will tend to the same 
limit. The right-hand side of (7) now tends to b, by hypothesis, and 
hence p(x)/y(x) on the left-hand side of (7) tends to the same limit. 

The theorem just proved gives us 
the following rule for evaluating an 
indeterminacy of the form 0/0: 

To find the limit of (x)\/p(x), in the 
case of an indeterminacy of the form 
0/0, the ratio of the functions can be 
replaced by the ratio of the derivatives, 
and the limit found of this new ratio. 

This rule was given by the French 
mathematician l'Hópital, and is usu- 
ally named after him. 

If the ratio of the derivatives also leads to an indeterminacy of 
the form 0/0, the rule can be applied to this ratio, and so on. 








Fie. 72 


We give some examples of using the rule: 


0-2)—1 imelo 





l. lim n. 
x-»0 x x0 1 
. — sin zx : — . si 
2. lim Z ST _ jim l—cosz — lim 22% 
x>0 x3 x>0 3a? x0 6a 
. 1 
—lim “osz__ 1 , 
x>0 6 6 


i.e. the difference x — sin x is an infinitesimal of the third order with respect 
to c. 








3. lim g — LOST . Am l—cosz-+asine _ 
'xo0 v — sing x10 1— cose 
: sin i - . ‘ — i 
= lim v- sine + x cos a = lim 2 cos x + cos x — x sing =3. 
x>0 sin a x>0 cos c 


The result of this example leads to a practical method for rectifying the 
arc of a circle. 

We take a circle of radius unity. A diameter of the circle is taken as OX, 
and the tangent at one end of the diameter as OY. (See Fig. 72.) 

We take a certain arc OM, and let ON be an interval along OY equal to 
arc OM. We draw NM and let P be its point of intersection with OX. 

Let u denote the length of arc OM (radius taken as unity). The equation 
of NM can be written in terms of its intersections with the axes: 





c Y _ 
op ta =) 
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We find the length of OP by noting that M lies on NM and has the 
coordinates: 


z=0Q=1—cosu, y= QM = sin u. 


These coordinates must satisfy the following equation: 





1—cosu sin u 
ce =1 
OP + u d 
whence 
(p= oS u cosu 


u — sinu ` 


The result of Example 3 shows that OP > 3 as u —> 0, iie. P tends to 
D on OX, the distance of D from the origin being three times the radius. 
This gives a simple method of approximate rectification of the arc of a circle. 
To rectify are OM, section OD must be taken from O, three times the radius 
of the circle, then the line DM must be drawn. The intercept of DM on OY, 
ON, gives the approximate length of are OM. This method gives a very satis- 
factory result, especially for small arcs; even for an arc of 2/2, the relative 
error is about 5%. 


66. Other indeterminate forms. The theorem of [65] can be justified 
for an indeterminacy of the form °/oo. Let 


lim y (x) = lim p(x) = oo (8) 
xa xa 
and 
sa 9 (2) —— 
> (9) 


We show that g(z)/v(z) tends to the same limit b, assuming that p’(x) 
does not vanish for x near a. 

We consider two values x and x, of the independent variable, 
near a, and such that x lies between x, and a. We have by Cauchy's 
formula: 

ple) — (zo) __ Ps) 
y(z) — v(zo) y” (E) 


whilst, on the other hand, 





(E between c and 25), 





e P(X») 

P(x) — P(e) _ m) . p(x) 
v(x) — vu) p(z) 1 — Je» 
p(x) 


We remark that it follows directly from (8) that p(x) and y(x) differ 
from zero for x near a. 
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Comparison of these two equations gives us, after re-arrangement: 





pe p(zo) 
IIA ACA v(z) (10) 
vie) — v(5 COS 
p(z) 


where £ lies between x and x, and hence between a and z,. We take 
x, sufficiently near a, so that, by (9), the first factor on the right of 
(10) differs from 6 by an arbitrarily small amount, for any choice 
of x between x, and a. Having thus fixed z, we let x approach a. 
The second factor on the right of (10) now tends to unity, by (8), 
and hence we can say that the left-hand side of (10) differs from b by an 
arbitrarily small amount for values of x near a, ie. 


lim ra) b 
xoa V(*) ! 





It follows from the theorem just proved that l'Hópitals rule can 
be used for evaluating an indeterminacy of the form co[co. 

We note some further indeterminate forms. We take the product 
g(x)y(x), and let 


lim g(z) = 0 and lim y(z) = oo. 
xa xa 


This gives the indeterminate form 0.co.It is easily transformed 
to the form 0/0 or co[co: 
— Pa)  w(z) 
ola) yla) = LEL = HL. 
px) p(z) 


We consider finally the expression g(x)” and let 


lim g(z) = 1 and lim y(z) = co. 
xa x-a 


This gives the indeterminate form 1”. We take the logarithm of 
the above expression: 


log [p(2)"] = v(z) log p(x) , 


giving the indeterminate form 0-_. By evaluating this indeterminacy, 
ie. finding the limit of the logarithm of the given expression, we 
can discover the limit of the expression itself. The indeterminate 
forms co% and 0% are similarly evaluated. 
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We now take some examples: 
x 








x 
1. lim 2 = lim © + co, 
X+to cT x>+00 1 
x x x 
lim -2 = lim -© = lim Ê = o. 


xX— +a el X--4- o 2x X>-+00 2 


It can similarly be shown that e*/a" tends to infinity as x tends to +o, 
for any positive n, ie. the exponential function e* increases more rapidly 
than any positive power of x, on indefinite increase of x. 





1 
2. ha E da 09 dm 1. 0 6 
xo+o 2 x> +0 ngi x>+o nan , 


i.e. log x increases more slowly than any positive power of x. 








1 
3. lim a"logz— lim 228% = lim —*__ 
x0 x^40 l x0 —7 
qn gri 
n 
— — lim =_=0 (n >0). 
x>+0 * 


4. We find the limit of z* for x > 0. The logarithm of the expression gives 
the indeterminate form 0 - co. This indeterminate form has limit 0, by Example 


3, and hence: 
lim z* — 1. 
x>-+0 


5. We find the limit of the ratio: 


lim .£ + sinc 
X-> 00 g t 
Numerator and denominator both tend to infinity. Using the rule for 
replacing the ratio of the functions by the ratio of their derivatives, we get: 


lim 1+ cosg 
x>w 1 
But 1 + cos a tends to no definite limit on indefinite increase of x, since 
cos z always oscillates between 1 and —1; yet it is easy to see that the given 
ratio itself tends to a limit: 
lim z -+ sing = lim (+ sinc ) =i 
x>0 g X> zx ` 
The indeterminate form has a limit here, yet the rule for finding it fails. 
This result does not contradict the theorem, since the theorem states only 


that, if the ratio of the derivatives tends to a limit, the ratio of the functions 
tends to the same limit: the theorem does not state the converse. 
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6. We also note the indeterminate form (co + co). It usually leads to 
the form 0/0. For example: 





( 1 1 )=1 v + z — sing 
x>0 (sin x at a x>0 (x-+2%)sing ^ 

The last expression consists of the indeterminate form 0/0. We evaluate 
this by the method given above: 


: 1 1 
lim - =I, 
lim (= zix] 





8 6. Functions of two variables 


67. Basic concepts. We have so far considered functions of a single 
independent variable. We now consider a function of two independent 
variables 


u = f(x, y). 


Values must be attributed to the independent variables: x = x, 
Y = Yp in order to define particular values of this function. Every 
such pair of values of x and y defines a corresponding point M, 
in the plane of coordinates, with coordinates (£o Yọ), and we can 
speak about the value of the function at the point M (£o, yo) of the 
plane, instead of the value for x = z,, y = yy. The function can be 
defined over the whole plane or only in some part of it, in a certain 
domain. If f(x, y) is an integral polynomial in x, y, e.g., 


u = f(x, y) = x? + xy + y? — 2x + 3y +T, 


the function can be considered to be defined by its formula over 
the whole plane. The formula: 


u= =F 


defines the function inside the circle z? + y? = 1 with centre at 
the origin and radius unity, and also on the circle itself, where u = 0. 
A similar interval on the plane is given by the domain, defined by 
the inequalities a < x < b; c < y < d. This is a rectangle with sides 
parallel to the axes, the boundaries of the rectangle being also included 
in the domain. The inequalities a < x < b, c < y < d define only 
interior points of the rectangle. If the boundaries of the domain are 
included in it, the domain is said to be closed. If the boundaries are 
not included in the domain, it is said to be open (cf. [4]). We shall 
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define the concept of limit for functions of two variables (cf. [32]). 
We suppose that the function is defined at the point M (£o, Yo) and 
at all points M(x, y) sufficiently near M,. 

DEFINITION. If a function f(x, y) has the same limit A, whatever the 
rule by which the point M(x, y) tends to M {£o Yo), we write: 


lim f(z, y) = A 
X—>X0 
Y>Yo 

or 
lim f(x, y) = 4. 
M>M, 


This definition is equivalent to the following: for any given positive 
é there exists a positive y such that: 


fy) -Al<e, if ix —z,| «u and |y—ys| <n, 


the pair of values x = x, and y = y, being excluded. The values 
£= Xy Y A Yo O XX y = Ya remain possible. If M, lies on the 
boundary of the domain in which f(x, y) is defined, M must tend 
to M, in such a way as to belong to this domain. 
The definition of continuity follows naturally (cf. [32]). 
DEFINITION. A function f(x, y) is said to be continuous at the point 


M (£o Yo), 3f : 


lim f(x, y) =f(%, Yo) or lim f(x, y) = F(X, 99). 
X—X, M>M, 
yoyo 


A function is said to be continuous in a certain domain if it is 
continuous at every point of this domain. 

The function w = /1 — a? — y?, for example, is continuous inside 
the circle in which it is defined. It can also be said of it, that it 
remains continuous if we associate the boundary with the circular 
domain, ie. the circumference where w = 0. 

Function f(x, y), continuous in a certain domain, including its 
boundary, has the following two properties, analogous to those of 
a function of a single independent variable, continuous in a certain 
interval [35]: 

(1) there is at least one point in the domain or on the boundary 
where it takes its greatest (least) value, relative to its remaining 
values in the domain, including the boundary; 
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(2) itis uniformly continuous in the domain, including the boundary, 
ie. for any given positive e there exists one positive 1 for all the 
domain, such that 


fs y) — fr y) | <e if [a,—a,| and ly —Yy]|<0, 


where (x1, Y1) and (£, Ya) are points of the domain. 
We note a consequence of the definition of the continuity of a 
function. If f(x, y) is continuous at the point (a, b), and if we put 
= b, the function f(x, b) of a single independent variable x is con- 
tinuous at x = a. Similarly, f(a, y) is continuous for y = b. 


68. The partial derivatives and total differential of a function of two 
independent variables. We suppose that y is constant, and only v 
varies, in the function u = f(x, y); w becomes a function of the single 
variable x and its increment and derivative can be found. Let Au 
denote the increment of u, when y is constant, and x has the incre- 
ment Az: 

Axu = f(z + Az, y) — fz, y). 
We get the derivative by finding the limit: 


lim -£ tim 1E t42 y) — Hey), 
Ax». 47 Ax>0 Ax 





This derivative, obtained with y constant, is called the partial 
derivative of u with respect to x, and is denoted by: 


Ou 


9 , , 
HEM or f(x,y) or ec 


We note that 0w/0x is not to be taken as a fraction, being purely 
the symbol for the partial derivative. If f(x, y) has a partial derivative 
with respect to z, it is a continuous function of z with y fixed. 

The increment 4,u is defined in exactly the same way, as is the 
partial derivative of u with respect to y, found by assuming that z 
does not vary: 








Of(z, y) , ðu — k Ayu Ral 
Ca OE hc. y) ra vus Ay 
ans, fim Rog y) Jg) : 

4y>0 y 


If, for example, 


Qu ðu 
— r? 2 == a 
u=a2?+y?, then On = 22, orm = 2y. 
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Wo take Clapeyron's equation: 
pv = RT. 


One of the magnitudes p, v and T can be defined with the aid of this 
equation in terms of the other two, these latter two being considered as 
independent variables. We get the table: 
































Independent T,p T,v i p,” 
variables 

2 RT RT _ py 
Function v= EN p= =a To R 
9o  R. Op R, oT v. 
Partial oT p’ oT v’ dp R?’ 
derivatives 9v RT Op RT oT p 
Op pP w v ap R 








The following relationship is thus obtained: 


œ ƏT o p 
9T ' Op 9V 


=-—l. 


If we were to reduce the left-hand side of this equation as a fraction, we 
should get (+1) instead of (—1). But the partial derivatives in this equation 
are worked out on different assumptions: 0v/OT' on the assumption that p 
is constant; 0T'/0p, on the assumption that v is constant; and Op/Ov with 
T constant. The reduction mentioned is thus not permissible. 


We let 4u denote the total increment of the function, obtained 
with simultaneous variation of x and y: 


Au = f(x + Ax, y + Ay) — f( y). 
Adding and subtracting f(x, y + ây), we can write: 


du = [f(z + Ax, y + Ay) — fe y + 491 + 
+ [f@æ, y + 4y) — fx, y]. 


The first square bracket gives us the increment of u with variable 
y fixed at (y + Ay), whilst the second square bracket gives us the 
increment of the same function with x fixed. We now assume that 
f(x, y) has partial derivatives over a domain that includes (x, y) as 
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an interior point, and we apply Lagrange's formula to each of these 
increments, as can be done, since only one independent variable 
changes in each case; we get: 


Au = falx + 04x, y + Ay) x + fy(x, y + 9, Ay) Ay , 


where 0 and 0, lie between zero and unity. Assuming the continuity 
of the partial derivative 9u/0x and 9u/0y, we can say that the coefficient 
of Ax tends to fx(x, y), and the coefficient of dy to filx, y), as Ax 
and Ay tend to zero; hence, 


Au = [file, y) + e]da + [fole, y) + €,14y 
or 
Au = f(x, y) Ax + fila, y) Ay + ede + eAy , (1) 


where e and e, are infinitesimals respectively with Az and Ay. 
This is analogous to the formula: 


Ay = y Ax + edz, 


that we obtained for a function of a single independent variable [50]. 
The products g4x and e,Ay are infinitesimals of higher order as 
regards Ax and Ay respectively. 

We recall that we based the above argument on the assumption 
not only of the existence, but also of the continuity, of the partial 
derivatives 9u/0x and du/dy over a certain domain, including (x, y) 
as an interior point. 

The sum of the first two terms appearing on the right of equation (1) 
is called the total differential du of function u. The arbitrary increments 
Ax and Ay of the independent variables coincide with their dif- 
ferentials dz and dy, as in the case of a single independent variable, 
so that 

du = fx(z, y)dz + fe, y)dy , 


or 


ðu Ou 


We can say, in view of the property mentioned above of the products 
sÁx and ¢,Ay, that for small values of Ax and Ay the total differential 
du gives the approximate magnitude of the total increment of the function 
Au. Evidently, on the other hand, the products du/dx dx and 9u/0y dy 
give the approximate magnitudes of the increments 4,4 and Ayu, 
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and thus, for small increments of the independent variables, the total 
increment of the function is approximately equal to the sum of its partial 
increments : 


Au c du ~ Ax wu + Ayu. 


Equation (2) expresses an extremely important property of functions 
of several independent variables, which may be called the “principle 
of combination of small operations”. Its essence lies in the fact. 
that the combined effect of several small operations, say Az and Ay, 
can be replaced with sufficient accuracy by the sum of the effects 
of each separate small operation. 


69. Derivatives of functions of a function and of implicit functions. 
We now suppose that the function u = f(x, y) depends on a single 
independent variable t through the medium of x and y, ie. we take 
x and y as functions of the independent variable t instead of being 
themselves independent variables. We define the derivative du/dt 
of u with respect to f. 

If t receives the increment Af, the functions x and y receive the 
corresponding increments Ax and Ay, and u receives the incre- 
ment 4u: 


Au = f(x + Ax, y + Ay) — f(x, y). 
We saw in [68] that the increment can be written in the form: 
Au = file + 04 x, y + Ay)A zx + fy, y + 0, Ay) dy. 
We divide both sides of this equation by At: 


A i d , - 
Mf (w+ Om, y + Ay) SE + fs (n, y + 0,49) Se. 


We have assumed that x and y have derivatives with respect to t, 
and thus are certainly continuous functions of t. Hence Az and Ay 
tend to zero as At tends to zero, and, assuming the continuity of 
du/dx and du/dy, the above equation gives us in the limit: 


SY = ft y) E +f, (es y) SU. (3) 


This equation gives the rule for differentiating a function of a function, 
in the case of a function of more than one variable. 
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We suppose that, in particular, x has the role of the independent 
variable t, i.e. that the function u = f(x, y) depends on the independent 
variable x both directly, and via the medium of y, a function of x. 
Noting that dz/dz = 1, we get from the fundamental equation (3): 


= fley) +f, (0,9) Se. (4) 


The derivative du/dx is called the total derivative of u with respect 
to x, as distinct from the partial derivative fi(x, y). 

The rule for differentiation just given can be used for finding the 
derivative of an implicit function. Let 


F(z, y) = 0 (5) 
define y as an implicit function of x, having the derivative 
y” = g'(x). 


Substituting y = p(x) in (5) would necessarily give us the identity 
0 = 0, since y = g(x) is a solution of (5). We thus see that the con- 
stant zero can be regarded as a function of a function of x, dependent 
on x both directly and through the medium of y = g(x). 

The derivative with respect to x of this constant must be zero; 
applying rule (4), we get: 


Pix, y) + Fy(v, y)y' = 0, 
whence 
, Fx (x, y) 
Y =— ODE 

Both x and y can enter into the expression thus obtained for y”, 
and if an expression has to be found for y” only in terms of the inde- 
pendent variable x, it still comes to a question of solving (5) with 
respect to y. 


§ 7. Some geometrical applications 
of the differential calculus 


70. The differential of are. It will be shown in the integral calculus 
how to find the length of an arc of a curve; the expression for the 
differential of the length of arc will then be investigated, and it will 
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be shown that the ratio of the length of the chord to the length of 
the arc subtending it tends to unity, when the arc contracts indefinitely 
to a point. 

Let y = f(x) be a given curve, and let the length of arc be 
measured on it from some fixed point A in a specified direction 
(Fig. 73). Let s be the length of arc from 
A to a variable point M. The magnitude 
s, like the ordinate y, will be a function of 
the abscissa x of M.If the direction of AM 
coincides with the specified direction of the 
curve, s > 0, whilst in the opposite case, 
s<0. Let M(x, y) and N(x + Ax, y + Ay) 
be two points of the curve, and 4 s the dif- 
ference of the lengths of are AN and AM, 
ie. the increment in length of are in 
passing from M to N. The absolute value of 4s is the length of 
arc MN taken with the plus sign. We have from the right-angled 
triangle: 





(UN? = Ax + Ay, 
whence 


Sm + (Ay 


MN Yy(4e > Ay Y 
( As ) (Gz) =1+ (3%) ° 
Passing to the limit, and noting that (MN/Asy —1 from what 
was remarked above, we get 


(e) ++ y 


=i VIF”. (1) 


or 





or 


We must take the positive sign if s increases as x increases, and the 
negative sign if s decreases with increasing x. We suppose the former 
case for clarity (illustrated in Fig. 73). It follows from (1) that 


ds = /1 + y? dz 


ds = | (dx)? + (dy). (2) 


or 
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The length s of arc AM is a natural parameter for defining the 
position of a point M on a curve. We can take s as independent 
variable, the coordinates (v, y) of a point M then being functions 
of s: 

s= p(s); y= ys). 
A more detailed discussion of the “parameters of a curve” is given 
in [74]. We now explain the geometrical meaning of the derivatives 
of x and y with respect to s. 

We take the point N so that the direction of arc MN coincides 
with the specified direction of the curve, i. e. As > 0. The direction 
of the chord MN gives in the limit, as N tends to M, a specified 
direction of the tangent at M. We take this as the positive direction 
of the tangent. It is associated with the direction specified for the 
curve itself. 

Let a, be the angle between the direction of MN and the positive 
direction of axis OX. The increment Az of the abscissa x is the 


projection of MN on OX, and hence: 
Az = MN cos a, 


(MN = Y Ax? + Ay’), 
MN being reckoned positive in this equation. On dividing both 
sides of the equation by the length of arc MN, equal to 4s, we get: 


Ax Vda? + Ay? 


—— a. 
As As COB a, 


By hypothesis, 4s > 0, and hence the ratio /4x? + 4y*/4s tends 
to (+1) as N tends to M, whilst a, tends to a, the angle between 


the positive direction of the tangent MT and the positive direction 
of OX. The equation just given becomes in the limit: 


de 
COS a= -is ° (3) 


Similarly, by projecting MN on OY, we get: 


: dy 
—— 4 
sin a is (4) 


71. Concavity, convexity and curvature. Curves, convex and concave 
towards positive ordinates, are shown in Figs. 74 and 75. 
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The same curve y = f(x) can, of course, have convex and concave 
portions (Fig. 76). Points separating convex from concave portions of 
a curve are called points of inflexion. 

If we move along the curve in the direction of increasing z, and 
observe the variation of a, the angle formed by the tangent with 


í Y 
Fic. 74 Fic. 75 


the positive direction of OX, we see that (Fig. 76) a is decreasing 
in convex, and increasing in concave, portions. It follows that tan a, 
ie. the derivative f'(x), will undergo the same variation, since tan a 
increases (decreases) with increasing (decreasing) a. But the interval 
in which f'(z) is decreasing is the interval where its derivative is 
negative, ie. f”(x) < 0; and similarly, the interval of increase of 
f'(x) is that where f"(x) > 0. We thus 
have the theorem: Y 
The portions of a curve are convex 
towards positive ordinates where f" (x) < 0, m oe 
being concave where f”(x) > 0. Points of i 
inflexion are the points where f"(x) 
changes sign. 7 
By using arguments analogous to the 
earlier ones of [58], we obtain from this 
theorem a rule for finding the points of 
inflexion of a curve: to find the points of inflexion of a curve, the values 
of x must be found for which f'(x) vanishes or does not exist, and 
the variation in sign of f"(x) must be investigated on passage through 
these values, using the following table : 





mflexion 
Fra. 76 





| Point of inflexion Not a point of inflex 





++ 


ro | +- | -+ 











conc. conv. conv. conc. 











convex | concave 
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The most natural way of representing the bending of a curve is 
by following the variation of a, the angle made by the tangent with 
OX, as we move along the curve. Given two arcs of the same length 
As, the more curved are will be that for which the tangent moves 

through a greater angle, ie. for which the incre- 


ec ment 4a is greater. This remark leads us to the 
Ar" concepts of the mean curvature of 4 s and of the 
M curvature at a given point: the mean curvature 
of an arc As is defined as the absolute value of the 

Fie. 77 ratio of Aa, the angle between the tangents at the 


ends of the arc, to the length of arc As. The limit 
of this ratio as As tends to zero is called the curvature of the curve 
at the given point (Fig. 77). 
We thus have for the curvature C: 


da 


C= ar 








But tan a is the first derivative y”, i. e. 
a = are tan y”, 


whence, differentiating the function of a function arctan y” with 
respect to x: 


da = dz. 


Y 
lay 
As shown above, 


ds = + /1 + y^ dz. 


Dividing da by ds, we get a final expression for the curvature: 


Ok (5) 


The minus sign is taken in convex parts, and the plus sign in con- 
cave parts, so as to give C a positive value. 

No curvature exists at points of a curve where the derivatives y” 
or y” do not exist. A curve resembles a straight line in the neigh- 
bourhood of points where y”, and hence the curvature, vanishes; 
this will happen, for instance, near points of inflexion. 

Suppose the coordinates x, y of a point of a curve are given in 
terms of the length of arc s. Here, as we have seen: 

dy 


cosa = de sin a = 
7 ds ” T ds ' 
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Angle a is also a function of s, and on differentiating the above 
equations with respect to s, we get: 
da die da dy 


—S8na- ae cos a “ae 


ds 


Squaring both sides of these equations and adding, we have: 
Lu] boe] (ae) o 


whence: : 
EC 








Fig. 78 Fia. 79 


The reciprocal of the curvature, 1/C, is called the radius of curvature. 
By (5), we have the following expression for the radius of curvature R: 


de 
da 


== 4- (1 4 y)? (6) 


k= y 








or 
1 


Bay (yy 
Vas) + (5) 
taking the positive value of the square root. 
In the case of a straight line, y is a polynomial of the first degree 
in x, and hence y” is identically zero, ie. the curvature is zero every- 


where on the line, the radius of curvature being infinity. 
We evidently have for a circle of radius r (Fig. 78): 





As=rda and R=lim =r, 
a 


ie. the radius of curvature is constant for the entire circle. We shall 
see later that the circle alone has this property. 


We remark that the variation of the radius of curvature is by no means 
as easily seen as that of the tangent. We take the curve made up of an are BC 
of a circle and a section AB of the tangent to the circle at B (Fig. 79). The 
radius of curvature is infinity for the portion AB, whilst for the portion 
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BC it is equal to the radius of the circle r; it thus suffers a break in conti- 
nuity at B, whereas the direction of the tangent varies continuously here. 
This accounts fos the jolting of railway carriages at bends. Assuming a car- 
riage travelling with fixed speed v, we know from mechanics that a force 
is exerted along the normal to the trajectory, equal to m v?/R, where m is 
the mass of the moving body, and R is the radius of curvature of the trajec- 
tory. Hence the force suffers a break in continuity at points where the radius 
of curvature suffers a break in continuity, which explains the jolts. 


72. Asymptotes. We now turn to considering curves with infinite 
branches, where one or both of the coordinates z and y increase 
indefinitely. The hyperbola and parabola are 

linf(x)=+00 limftx)=+e2 examples of such curves. 

A straight line is referred to as an 
asymptote of a curve with an infinite branch 
when the distance of points of the curve from 

x the line tends to zero on indefinite displace- 
ment along the infinite branch. 

We first show how to find the asymp- 
totes of a curve parallel to axis OY. The 

limfx)=-e2 equation of an asymptote of this sort 


er Os SED must have the form: 
Fie. 80 





=c, 


where c is a constant, and x must tend to c, whilst y tends to infinity, 
on moving along the corresponding infinite branch (Fig. 80). We thus 
get the following rule: 

All the asymptotes parallel to OY of the curve 


y = f(x) 


can be found by finding all the values x = c, on approach to which f(x) 
tends to infinity. 
To find the position of the curve relative to the asymptote, the 
sign of f(x) must be determined as x tends to c on the left and right. 
We pass to finding the asymptotes, not parallel to O Y. The equation 
of the asymptote must now have the form: 


y=ai +b, 


where & y are the current coordinates of the asymptote, as distinct 
from z, y, the current coordinates of the curve. 
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Let œ be the angle that the asymptote forms with the positive 
direction of OX, let MK be the distance of a point of the curve from 
the asymptote, and MK, be the difference between the ordinates 
of the curve and asymptote for the same abscissa x (Fig. 81). We have 
from the right-angled triangle: 

MK, = ME (o A E a) k 


cos @ 


and hence we can replace the condition: 


lim MK —0 
X— e 
by the condition 
lim MK, =0. (7) 
X— 20 
For an asymptote not parallel to OY, x tends to infinity on moving 
along the infinite branch. Recalling that MK, is the difference between 
the ordinates of the curve and asymptote for the same abscissa, 
we can rewrite condition (7) as: 


lim [f (2) — az —b] =0, (8) 


where the values of a and b have to be 
found. 
We can rewrite (8) in the form: 


lim «| 42) a—=|=0; 


x>0% ud 





Fic. 81 





the first factor x tends to infinity, so that the expression in square 
brackets must tend to zero: 


lim |£& — a — 2] = tim 1€ — 4 = 0, 
X—»00 c c Xo Y 
i.e. 
a = lim 4E, 
Xo c 

Having found a, we obtain b from the basic condition (8), which 

can be written as: 
b = lim [f (x) — ax]. 
X> 


Thus, a necessary and sufficient condition that the curve: 
y = f(x) 
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has an asymptote not parallel to OY is that, with x increasing indefinitely 
on movement along the infinite branch, the limits exist : 


a=lim!®, b= lim [f (2) — az], 


x->0 X c 


the equation of the asymptote then being : 
7-a2£€-rb. 


To find the position of the curve relative to its asymptote, the 
cases of æ tending to (4-99) and to (— 99) have to be worked out 
separately, and the sign of the difference, 


Fx) = (ax + b) , 


determined in each case. If the sign is 
positive, the curve is situated above the 
asymptote, and if negative, below the 
asymptote. If the difference does not 
keep the same sign on indefinite increase 
Fra. 82 of x, the curve will oscillate about the 
asymptote (Fig. 82). 
73. Curve-tracing. We now give a fuller indication than in [59] of 
the series of operations to be carried out in tracing the curve 


y = f(z). 





We must: 

(a) define the interval of variation of the independent variable z; 

(b) find the points of intersection of the curve with the coordinate 
axes; 

(c) find the peaks of the curve; 

(d) find the convexities, concavities, and points of inflexion of 
the curve; 

(e) find the asymptotes of the curve; 

(f) examine the symmetry of the curve relative to the coordinate 
axes, if such symmetry exists. 

The curve can be traced more accurately if an extra series of points 
on it are plotted. The coordinates of these points can be found from 
the equation of the curve. 

1. We trace the curve 

_ (#— 3)? 
4(x—1) * 


(a) x can vary in the interval (—co, 4-00). 
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(b) Putting z = 0, we get y = — 9/4; putting y = 0, we get x = 3, i.e. 
the curve intercepts the axes at the points (0, — 9/4) and (3,0). 
(c) We find the first and second derivatives: 


_ (œ— 3) (041) 2 


f(x) 4 (x — 1) , P (a) = (x — 15" 





We apply the usual rule to find the peaks: (3,0) is & minimum, (—1, —2) 
is a maximum. 

(d) It is clear from the expression for the second derivative that it is 
positive for x > 1, and negative for x <1, ie. the curve is concave in the 
interval (1, oo), and convex in (— «o, 1). There is no point of inflexion, since 
f'(x) changes sign only at x = 1, where the curve has an asymptote paral- 
lel to OY, as we shall see next. 

(e) y becomes infinite at z — 1, and the curve has an asymptote: 


ql. 


We now look for asymptotes, not parallel to OY: 


a = lim (e im — 
x^o 4A(x—1l)x X—-00 (( - >) 4" 








b= dim LY. 2 lim +9 _ 
X00 4 (% — 1) 4 xX- 4 (x — 1) 

9 

E dur) EE 

MS 4 
x 

The asymptote is thus: 

ti Se 
y= 4 4° 


We propose to the reader the investigation of the position of the curve 
relative to the asymptotes. 

(f) Symmetry does not exist. 

Transferring all the data obtained to the figure, we get the curve of 
Fig. 83. 

2. We investigate the curves: 


y = c(a? — a?) (Bat — a?) (e <0) 
yi = e(a? — a?) , 


which give the shape of & heavy beam, bending under its own weight, the 
first curve referring to the case when the ends are freely supported, and 
the second to the case when the ends are constrained. The total length of the 
beam is 2a, the origin is at the centre of the beam, and axis OY is directed 
vertically upwards. 
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(a) The variation of x evidently only interests us in the interval (—a, +a) 

(b) Putting «= 0, we get y = ¿cat and y, = cat, i.e. the bending at 
the centre of the beam is five times greater in the first case than in the second. 
For x — +a, y = y, = 0, corresponding to the ends of the beam. 

(c) We find the derivatives: 


y! = —4ex(3a? — x?), y” = —12c(a? — a?) , 


yj = —4cx(a? — a?) , yi = —4e(a? — 322), 


There will be a minimum at « = 0 in the interval (—a, -|-a) in both cases, 
corresponding to the bending of the centre of the beam, mentioned above. 








Fig. 83 Fia. 84 


(d) In the first case, y” > 0 in the interval (—a, +a), i.e. the entiro 


beam is concave upwards. In the second case, Y vanishes at «= —a[l3; its 
sign changes here, and the corresponding points are points of inflexion of 
the beam. 

(e) There are no infinite branches. 

(f) Both equations remain unchanged on replacing z by (—x), i.e. both 
curves are symmetrical about OY. 

The two curves are shown in Fig. 84. We have taken the cases a = 1, 
c = —1, for simplicity; the length of the beam is considerably greater than 
its bending, in practice, i.e. a is considerably greater than c, so that the curve 
of bending looks rather different (how). 

We suggest that the points of inflexion of the curve: 
ye 
might be found by the reader, and a comparison made with the graph of 
Fig. 60. 
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74. The parameters of a curve. Given the properties of a geometrical 
locus, it is not always convenient or possible to find its equation by 
expressing its properties directly in terms of an equation connecting 
the current coordinates x and y. It is useful in this case to introduce 
a third, auxiliary variable, in terms of which the abscissa x and 
ordinate y of any given point of the locus can be separately expressed. 

The combination of two equations obtained in this way: 


x = plt), y= p(t) (9) 


can also be used for plotting and investigating a curve, since each 
value of t defines the position of a corresponding point of the curve. 

This method is referred to as parametric representation of a curve, 
the auxiliary variable ¢ being a parameter. To obtain the equation 
of the curve in the usual (explicit or implicit) form as a relationship 
between x and y, the parameter ¢ must be eliminated from equations 
(9), as might possibly be done by solving one of the equations with 
respect to t, and substituting the result in the other. 

Curves given by parameters are especially met with in mechanics, 
as when finding the trajectory of motion of a point, the position of 
which depends on time t, so that its coordinates are functions of t. 
The trajectory is given by a parameter, when these functions are 
known. 

For instance, the equation of a circle with centre at (£o yo) and 
radius r is given in terms of a parameter as: 


gz = Xy | reost; y= Yo+rsine. (10) 
We rewrite these equations as: 
L—Xy=reost; y—Y,=rsine. 


We eliminate the parameter t by squaring both sides and adding, 
which gives the ordinary equation of a circle: 


(£ — e? + (y — yo)? = rn. 
Similarly, it is immediately obvious that 
y=acost; y= bsint (11) 


are the parametric equations of the ellipse: 


x? y? 
Tar tor = 
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Let y be defined parametrically by formula (9) as a function of x. 

An increment At of the parameter produces corresponding incre- 
ments Ax and Ay, and by dividing numerator and denominator of 
Az by At, we find the following expression for the derivative of y 
with respect to 2: 








Ay 
; _. Ay : At v'(t) 
= lim —* = lim = 
Us Ax+9 AX Axo AT. pt) 
At 
or 
dy v(t) 
"ds A qu) 





Using the rule for finding the differential of a fraction, we get [50]: 


y dy. de — d'z. d 
y= age (13) 
But by (9): 


dx = p'(t) dt, dx = g"(t) d? 
dy = p’(t) dt, dy = p(t) dé’. 
Substituting in (13) and cancelling dt, we finally have: 


AO (0 — v" (0 AO 
y [o OF : Me) 








We remark that the expression (13) for y” differs from the expres- 
sion (3) of [55] for the same derivative (n = 2), 


" d 
y = e i (15) 


this latter formula being obtained when z is the independent variable, 
whereas t is the independent variable in the parametric form of (9). 
When x is the independent variable, de is treated as constant [50], 
ie. independent of x, so that d%x = d(dz) = 0, being the differential 
of a constant. Formula (13) now reduces to (15). 


74] THE PARAMETERS OF A CURVE 177 


Now that we can determine y” and y”, we can supply information 
regarding the direction of the tangent to the curve and its convexity 
and concavity, etc. 


We take as an example the curve given by the equation: 
x3 + y? — 3azy = 0, (a > 0) (16) 


this being known as the “folium of Descartes”. 
We introduce a ¡variable parameter ¢, putting: 


y=tw, (17) 


and we consider the points of intersection of the curve with the straight 
line (17) that has variable slope t. Substituting 
for y from (17) in (16) and cancelling out 
a", we get: 


_ Bal 
LESA 
whilst (17) now gives: 


_  3al 
14448" 





These equations give the folium of Des- 
cartes in parametric form. We find the derivatives of x and y with respect to t: 


armor (s) 


E C dx (18) 
ro 21-8) — 32-2 — 3at(2 — t) 
wow ET TF 


We investigate the variation of x and y by dividing the total interval 
(— œ, + œ) of variation of t into separate parts, within which æ; and y; 
preserve invariable sign and do not tend to infinity. We thus note the values 


1 3 
t=— l, 0, 7 and y2, 


y2 


for which these derivatives vanish or become infinite. The signs of x; and y; 
within these intervals are found easily from (18); on calculating x= and y 
at the ends of these intervals, we get the table below. 

The curve corresponding to the table is shown in Fig. 85. 
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Interval of t 21 Y z y 

(—9o, —1) +i increases from 0 to co decr. from 0 to —co 
(—1, 0) +|- incr. from —co to 0 deer. from -+œ to 0 
3 3 Si 
(0,1/y2) +i+ incr. from 0 to y4a incr. from 0 to Y2a 
2 3 3 oO 3 3 3 
ANZ, Y23) | — | +} deer. from /4a to Y2a incr. from Y?a to Vda 

3 = 3_ 8. TE 
(V2, +00) =|-— decr. from Y?a to 0 decr. from Y4a to 0 











The slope of the tangent is given by the formula: 
t(2 — t?) 
tle NETTE UN * f 
y = = 2 ( 3 t 
We notice that x and y are zero for t = 0 and t = co, and the curve cuts 


itself at the origin, as is clear from the figure. 
Formula (19) gives us: 


(19) 


Y= 0 for t=0, 


ro E 1) 

Aer Q2—15) a; ( t 

yx = lim i lim Pu y EET E 

to ide og i>0w A eus 
(s -») 2-1 

i.e. of the two branches of the curve, that intercept each other at the origin, 

one touches OX, and the other OY. 


As £t tends t),io (— x and y tend to infinity, so that the curve has infinite 
branches. We find the asymptote: 





= oo for t= co, 


slope of asymptote == lim t rum 
Xo T 


Sai? (1 + 85) 


= lim HTA ——], 
t+—1 3at(l + Ë) 


3ai? + 3at imn 6at + 3a C 
IFE m1 38 





b= lim (y + 2) = lim 
to—1 to—1 
i.e. the equation of the asymptote is: 


y= —x—a or +y a=0. 
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75. Van der Waal's equation. A gas that accurately satisfies Boyle's 
law and the Gay—Lussac law is known to satisfy the relation: 


pv = RT, 


where T is absolute temperature, and ÈR is a constant that is the same for all 
gases, provided one ''gram-molecule" is taken, i.e. the number of grams 
of gas is equal to its molecular weight. 

Gases do not strictly obey the above e) 
relationship in practice, and van der 
Waals gave a second expression that 
is a good deal more accurate. This 
formula reads: 


bo 


oe 


(o 5) 0-0 = 


where a and b are positive constants I 
that differ for different gases. $0 Ni” 
Solving the equation for p gives: n 


x 40 
a 
Diem e n (20) 
v—b vt 30 
We consider p as a function of v, with 
T constant, i.e. we take an isothermal 0 
chango of state. We find the first deriv- 





ative of p with respect to v: 10 
dp _ RT 4 2a —— 
dv (v — b)2 v abe 


Fra. 86 





1 2a(v — b)? 
DEN | = Er | : (21) 
We shall only take v > b. The reader is referred to a textbook of physics 
for the physical significance of this condition, as also of the curves that we 
shall obtain. 
Equating the derivative to zero, we get: 


2010 OY pen, (22) 


ys 





We investigate the variation of the left-hand side of this equation as v 
varies from b to (+o); thus we find its derivative with respect to v, recal- 
ling that RT is constant by hypothesis: 





q3 vs yi ? 


[as [= 2(v — b) v8 — Bor (v — b) — — 2a(v — b) (v — 3b) 
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whence it is clear that the derivative is positive for b < v < 3b, and negative 
for v > 3b, i.e. the left of (22) increases in the interval (b, 3b), and decreases 
on further increase of v, whilst it reaches a maximum at v = 3b, equal to 


8a 


P RT. 


It is easily seen by direct substitution that the left of (22) gives (— ET) 
for v = b and v = +00, and hence is negative. If the maximum obtained is 
also negative, i.e. if 

8a 


RT 


the left of (22) is always negative, and clearly, from (21), the derivative 
dp/dv will also always be negative, i.o. p decreases with increasing v. 
On the other hand, if 
8a 


BTW 


the left of (22) has a positive maximum for v = 3b, and (22) has one root 
v, in the interval (b, 3b) and another root v, in (3b, +00). The left of (22), 
and hence dp/dv, changes sign from minus to plus as v passes through v,, 
ie. a minimum p corresponds to this value of v. Similarly, v = v, gives a 
maximum p. 
If, finally, 
8a 


BT => 


(23) 
the maximum of the left of (22) is zero, and v,, v, merge into the single value 
v = 3b; the left of (22), and dp/dv, preserve the minus sign on passage through 
this value, i.e. p is constantly decreasing with increasing v, so that v = 3b 
gives the point of inflexion K of the curve. The values v = v,, p = Pp, COT- 
responding to the point of inflexion, along with T = T, as defined by (23), 
are called the critical volume, critical pressure, and critical temperature of 
the gas. The forms of the curve, corresponding to the three cases considered, 
are shown in Fig. 86. 


76. Singular points of curves. We take the equation of & curve in implicit 
form: 


F(x, y) — 0. (24) 
The slope of the tangent to the curve is given by [69]: 
Fx (2 y) 
e ; 25 


where x, y are the coordinates of the point of contact. 

We take the particular case of F(x, y) being an integral polynomial in x 
and y. The curve (24) is called algebraic in this case. The partial derivatives 
Fix, y) and Fila, y) have fully determined values if the coordinates of a 
point M of curve (24) are substituted for x and y, and (25) defines the slope 
of the tangent in every case except those where the coordinates z, y, of the 
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point cause F (x, y) and F,(x, y) to vanish. Any such point M is called a 
singular point of the curve (24). 

A singular point of an algebraic curve (24) is a point, the coordinates of which 
satisfy (24) and also the equations: 


Pix y=0, Fz, y) = 0. (26) 
In the ease of the ellipse 
az? y? 
or toe =) 


condition (26) gives us x = y = 0; but (0,0) does not lie on the ellipse, which 
thus has no singular points. The same can be said of the hyperbola and 
parabola. 

For the folium of Descartes: 


x? -+ y? — 3azy = 0, 
condition (26) takes the form: 
3a? — 3ay =0 and 3y? — 3ax=—0, 


and it is immediately evident that the origin (0,0) is a singular point of the 
curve. We showed that the folium of Descartes cuts itself at the origin, the 
two intercepting branches of the curve having different tangents at this 
point: OX is the tangent to one branch, and OY the tangent to the other. 

A singular point, where different branches of a curve intercept, each branch 
having its own particular tangent, is called a node. 

For instance, the origin is a node of the folium of Descartes. 

We give some further examples to show the various types of singular point 
of algebraie curves. 

l. We take the curve: 


y? ~as? — 0 (a 0), 


known as a semicubical parabola. It can easily be seen that the left-hand side 
of this equation, together with its partial derivatives with respect to x and 
y, vanishes at (0,0), so that the origin is a singular point of the curve. We 
draw the curve, so as to study it near this singular point. The explicit form 
of the equation is: 

y = + Var. 


It is sufficient to draw the section of the curve corresponding to the plus 
sign, since the section with the minus sign is symmetrical as regards the first 
section about OX. It is clear from the equation that z cannot be less than zero, 
and that y increases from 0 to (-+ co) as x increases from 0 to (+ co). 

We find the first and second derivatives: 


y =--Vas; y. 


We have y =0 for =0, and on noting that x can only tend to zero 
through positive values, we can say that OX is a tangent to the curve on 
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the right at the origin. It is also evident that y” remains positive for the sec- 
tion of the curve in question in the interval (0, + co), i.e. this section is con- 
cave on the side of positive ordinates. 

Figure 87 shows the curve concerned, for a = 1. T'wo branches of the curve 
approach the origin without going past it, the branches being on different sides 
of the same tangent at this singular point (and always on different sides in the 
present case). Such a singular point is called a cusp of the first kind. 

2. We take the curve: 


(y — x)? — x5 = 0. 


Y 
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lb can easily be seen that the origin is a singular point. Tho explicit form 
of the equation is: 
y =x? t yas. 


It is clear from this equation that x can vary from 0 to (-+ oo). We find 
the first and second derivatives: 


5 ors 15 y- 
PE 3 "o m 
y’ =2x + 3 ya, y 2 + i| 


and investigate separately the two branches of the curve corresponding to 
the (+) and (—) signs. 

We first of all remark that y” = 0 for xz = 0 in both cases, so that, as in 
the previous example, OX is a tangent on the right for both branches. 

We get the following results on investigating in the usual manner: y increases 
from 0 to (+ co) as x increases from 0 to (+ o») along the first branch, and 
the curve is concave; there is a peak (maximum) in the second branch at 
x = 16/25, a point of inflexion at z = 64/225, and a point of intersection 
with axis OX at x= 1. 

All these data result in the curve shown in Fig. 88. 
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Two branches of the curve meet at the origin without going past it, have the 
same tangent at the origin, and are located on the same side of the tangent in 
the neighbourhood of this singular point. Such a singular point is called a 
cusp of the second kind. 

3. We consider the curve: 


y — xt t =O. 
The origin is a singular point. The equation gives explicitly: 


y= 4yil- z. 


The implicit equation contains only even powers of x and y, so that the 
coordinate axes are axes of symmetry of the curve, and it is sufficient to 
take only the section of the curve corresponding to positive values ofa and 
y. It is evident from the explicit equa- 
tion that x can vary from (—1) to 1. y-xf*x8=9 Y 

We find the first derivative: Q5 ! 





x(2 — 3x2) 


yji-r ` ~/ 


" 





We have y= y’ = 0 for x= 0, 
ie. the tangent at the origin 
coincides with OX; whilst for 
x= 1, y = 0 and y’ = co, ie. the 
tangent at (0,1) is parallel to OY. Fic. 89 
We find by the usual rule that the 
curve has a peak at x= y2[3. 

The above data, including the symmetry, give us the curve of Fig. 89. 
T'wo branches of the curve, corresponding to the plus and minus signs before 
the square root, touch each other at the origin. A singular point of this sort is 
ealled & point of osculation. 

4. We consider the curve: 


y? — ax — 1) = 0. 
The origin is a singular point. The explicit form of the curve is: 


y = + Væ? (s — 1). 


Since the expression under the square root must not be negative, we can 
say that either x = 0 or x >l. 

We have y = 0 for z = 0. We now consider the branch corresponding to 
the plus sign. As x increases from 1 to (+ oo), y increases from 0 to (+ c»). 

It is evident from the expression for the first derivative: 


3x— 2 
2yx l | 


that y” becomes infinite for z = 1, ie. the tangent at (1,0) is parallel to 
OY. The second branch of the curve, corresponding to the minus sign, is 


y = 
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symmetrical with the first branch about OX. These data give us the curve 
shown in Fig. 90. In this case, whilst the coordinates of point O (0,0) satisfy 
the equation of the curve, there are no other points of the curve in its vicinity. 
This type of singular point is called an isolated point. 

The above-mentioned types of singular point exhaust the possibilities as 
regards algebraic curves, though the coincidence of singular points of the 
same or different types is possible at certain points of algebraic curves. 

Non-algebraic curves are referred to as transcendental. 


4 
3 
2 
/ 
0 
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We suggest that the reader might show that the curve of 
= y log x 


is as shown in Fig. 91. The origin is a stop point of the curve. 


77. Elements of a curve. We give basic formulae connected with 
the concepts of tangent and curvature of a curve, and introduce some 
new concepts associated with the concept of tangent. 

If the equation of the curve has the form: 


y = f(x) > (27) 


the derivative f'(x) of y with respect to x is the slope of the tan- 
gent, the equation of which can be written in the form: 


Y—y=y(X —2); (y = f(x), (28) 


where (x, y) are the coordinates of the point of contact, and (X, Y) 
are the current coordinates of the tangent. The normal to a curve at 
any point of it (x, y) is the perpendicular drawn through the point to the 
tangent at the point. We know from analytic geometry that the per- 
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pendicular to a line has a slope equal to the reciprocal with changed 
sign, i.e., the slope of the normal is (— 1/y^), and the equation of the 
normal can be written as: 
Y -y=--(X-3) 
(X — 2) + y (Y —y)=0. (29) 
Let M be any point of the curve, and let T and N be the points of 
intersection of the tangent and normal to the curve at M with axis 
OX; also, let Q be the base of the per- 
pendicular dropped from M to OX 
(Fig. 92). The segments QT and QN on 
OX are called respectively the subtangent 
and subnormal of the curve at M ; there 
are definite numbers corresponding to 
these segments, positive or negative, 
depending on their direction along OX. 
The lengths of segments MT and MN 
are referred to respectively as the lengths 
of the tangent and normal to the curve 
at M, these lengths always being reckoned positive. The abscissa 
of Q on OX is evidently equal to the abscissa z of M. Since T and N 
are the points of intersection of the tangent and normal with OX, 
their abscissae must be found by setting Y = 0 in the equations of the 
tangent and normal, then solving the equations obtained with respect 
to X. We thus get (x—y/y”) for the abscissa of T, and (x + yy’) for 
the abscissa of N. The magnitudes of the subtangent and subnormal 
are now easily found: 


QT=0T—0Q=2-L-2=-2, 
y y (30) 
QN = ON — OQ = x + yy — xt — yy’. 
The lengths of the tangent and normal can now be found from the 
right-angled triangles MQT and MQN: 


prr| = Va +r yy = x lity, 


or 
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(81) 








[MN] = MO? + QN yy Ey? Y Yi y^. 
where the sign (4-) must be chosen so that the expression on the right- 
hand side is positive. 
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We recall the formula for the radius of curvature of a curve [71]: 
y Od yy 
R= g EA. (32) 


Denoting the length of the normal by n, we get from the second 
of formulae (31): 


Wytt. 


and, on substituting this expression for V1 + y? in (32), we get the 
following further expression for the radius of curvature: 


Bie v (32,) 


vy / 


If the curve is given parametrically: 


c= e(t), y = p(t) , 


the first and second derivatives of y with respect to x are given 
by [74]: 
,_ dy y' (t) 


Yar — wq 











5, . QOiyde —d'iady _ y (tp (t) — 9" (t) v' (t) 
Mes dy* d [pr OF : (39) 


In particular, substituting this expression in (32), we get the follow- 
ing expression for the radius of curvature in this case: 








= (da? + dy?)8/2 
R + d*y dz — dix dy 
(v OP + ty (Py — , de 
== OU POP E da’ (34) 


where a is the can formed by the tangent with OX. 
If the curve is given implicitly as 


F(z, y) = 0, 
we obtain the equation of the tangent from (25) as: 


Falz, y) (X — 2) + Filz, y) (Y — y) = 0. (35) 
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78. The catenary. The curve whose equation is, with suitable choice of 
coordinates: 


y = (e eT) (a 0) 


is called the catenary. It gives the shape of a uniformly heavy string, hanging 
from its two ends. It is easily drawn by the rules of [73], and its form is 
shown in Fig. 93. 

We find the first and second deriva- 
tives of y: 


whence 
14 y?=1+ 
ib ea. 9 us e a 
7 4 
(exa 4. 974) y? 
4 “a 


Substituting this expression for (1 + y^?) in the second of formulae (31), 
we get for the length of the normal to the curve: 

















= 





2 
roi 


Q 


, 


and substituting for n and y” in (32,), we get: 


8 y? 2 
a 
nu dut e rod 


“as yy a 





i.e. the radius of curvature of the catenary is equal to the length of the normal 
MN. The ordinate has its least value y = a at x = 0, and the corresponding 
point A of the curve is called its vertex. 

Some further auxiliary lines are shown in the figure, which are needed 
by us later. The equation of the catenary is unchanged on substituting (—«) 
for x, ic. OY is an axis of symmetry of the curve. 


79. The cycloid. We imagine a circle of radius a, rolling without slipping 
along a stationary straight line. The locus traced out by any point M of 
the circumference of the moving circle is called a cycloid. 

We take the line on which the circle rolls as axis OX; we take as origin 
the initial position of the point M when this is the point of contact of the 
circle with OX, and we denote the angle of rotation of the circle by ¢. Further, 
we call the centre of the circle C, its point of contact with OX at a given 
position N; the base of the perpendicular dropped from M on to OX is called 
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Q, and the base of the perpendicular dropped from M on to the diameter 
NN, of the circle is called R (Fig. 94). 
Noting that, with no slipping: 


ON = arc NM = at, 


we can write the coordinates of M, describing the cycloid, in terms of para- 
meter ¢ = angle NOM: 


xz = OQ = ON — QN = at — a sin t = a(t — sin t), 
y = QM = NC — RO = a — a cos t = a(l — cos t). 
This gives the cycloid in parametric form. 


We remark first of all that it is sufficient to consider ¢ varying in the 
interval (0,2z), which corresponds to a full turn of the circle. After this 
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full turn, M again coincides with the point of contact O' of the circle with 
OX, having simply shifted along OO’ = 2720. The section of the curve obtained 
with a further revolution will be exactly like arc 00”, being obtained by 
moving this arc along by an amount 2za to the right, and so on. We now 
find the first and second derivatives of x and y with respect to t: 


LE — y (f) =a (1 — cost), SY =y (t) — asin t, 

36 
d'z , : d?y 7 in 
m cy (t) — a sint, "de =Y (i) = a cost. 


By the first of formulae (33), the slope of the tangent is: 


2 si : tco l t 
ya asint E wb VOORT — 
a(l — cos t) 2sint >t 2 
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This formula leads to a simple method of constructing the tangent to 
a cycloid. We associate the point N, with point M of the circle. Anglo MN,N 
is the angle subtended at the circumference by arc NM = t, and is therefore 
equal to t/2. We have from the right-angled triangle RMN, (Fig. 94): 


1 1 
4 RMN, = 1-7 t tan £ RMN, = cott. 


On comparing this with the expression for y”, we see that MN, is a tangent 
to the eyeloid, i.e.: 

In order to construct the tangent at any point M of a cycloid, it is sufficient 
to join this point to the end N, of the diameter whose other end is the point of 
contact of the rolling circle with the axis OX. 

The line MN, joining M to the other end of this diameter of the circle, 
is perpendicular to MN,, since angle N,MN is subtended by a diameter; 
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hence MN is the normal to the cycloid. The length of the normal, n = MN, 
is found directly from the right-angled triangle N,MN: 


qm 
n = 2a sin > t. 
We obtain the radius of curvature of the cycloid from (34) and (36): 


R [æ (1 — cos t)? + a? sin? £]3/2 
acost x a(l—ocosi)— asint X asint — 





a(2 — 2 cos ¿)3/2 
cost — l 


mE 





= 23/2 X a(l — cos t)! = 4a sin + t. 


We leave only the plus sign standing in the last expression, since ¢ lies 
in (0,22) for the first branch of the cycloid, and sin ¢/2 cannot be negative. 

Comparing this expression with that for the length of the normal n, we 
have R = 2n, i.e. the radius of curvature of a cycloid is equal to twice the length 
of the normal (MC, in Fig. 94). 

If the point M that described the cycloid were to lie inside, or outside, 
instead of on, the circle, the corresponding curve obtained when the circle 
rolled would be a curtate or prolate cycloid (both curves are sometimes called 
trochoids). 
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Let h denote the distance of M from the centre C of the rolling circle. 
The rest of the notation is the same. We first take h <a, i.e. the case of 
M lying inside the circle (Fig. 95). We have directly from the figure: 


x == OQ = ON — QN = at — h sin t; y= QM = NO — RO =a —h cos t. 





The equations are the same for h < a, but the curve has the form shown 
in Fig. 96. 








Fig. 96 


80. Epicycloid and hypocyeloid. If the circle carrying the point M rolls 
without slipping on another circle, instead of on a straight line, two general 
classes of curves are obtained: epicycloids, when the rolling circle is outside, 
and hypocycloids, when it is inside, the fixed circle. 

We find the equation of the epicycloid. We take the centre of the fixed 
circle as origin; axis O.X is taken along the line joining this centre O to K, 
the initial position of point M, where the two circles originally touch each 
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other. Let a be tbe radius of the rolling circle, and b the radius of the fixed 
circle; let parameter £ be the angle between OX and radius ON of the fixed 
circle, where N is the point of contact of the rolling circle after it has turned 
through an angle p= ZNCM (Fig. 97). 
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Since the circle rolls without slipping, we can write: 


arc KN = are NM, 
i.e. 


bi = ag, p= D. 


Woe find directly from the figure: 
z = OQ =0L + LQ =0C cos ¿ KOC —OM cos 4 SMC = 














= (a + b) cos t — a cos (¿+ y) = (a + b) cos t— a cos 2t, 
y = YM = LO — RO = 0€ sinz KOC — OM sin 1 SMC = (37) 
= (a + b) sin t—a sin (t + 9) — (a + b) sin t—a sin Th 


[^] 


The curve consists of a series of identical arcs, each corresponding to a 
complete revolution of the moving circle, i.e. to an increase of y by 2x, and 
of t by 2an/b 

The ends of the ares thus correspond to: 

T Nd 2pan 
AO Ga p A 

A necessary and sufficient condition that we should eventually arrive 
at the initial point K of the curve is that one of these ends should coincide 
with K, i.e. that there should exist 
integers p and q satisfying 


since & certain number of complete 


turns about O corresponds to K. 
The above condition can be written: 


ERE 
b p 





Such numbers p and q exist if, Fia. 98 
and only if, a and b are commensu- 
rable with each other; otherwise, 
ajb is irrational and cannot be equal to the ratio of two integers. 
It thus follows that the epicycloid represents a closed curve if, and only 
if, the radius of the moving circle is commensurable with that of the fixed 
circle; otherwise, the curve is not closed, and it is impossible ever to return 
to the starting-point K. 
This remark also applies to the hypocycloid (Fig. 98), the equation of which 
can be obtained by simply replacing a by (—a) in the equation of the epicycloid: 


— a 
t, 





x = (b — a) cost + a cos è 
(38) 


y = (b — a) sin £ — a sin 1 t. 
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We note some particular cases. Let b = a in the case of the epicycloid, 
ie. the radii of the fixed and moving circles are equal. We get & curve con- 
sisting of a single branch in this case (Fig. 99), its equation being, by substitu- 
tion of b — a in (37): 


x = 2a cos  — a cos 2t, 
y = 2a sin t — a sin 2t. 


This curve is called a cardioid. 
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We find the distance r of point M (z, y) of this curve from the point K, with 
coordinates (a, 0); for this purpose, we write expressions for (x — a) and 
y in a more convenient form: 


x — a = 2a cos t — a(cos? t — sin? t) — a = 2a cost — 2a cos? t = 
= 2a cos (1 — cost), 
y = 2a sin t — 2a sin t cost = 2a sin t(1 — cost), 


whence 





r=|KM|=/( =a} +} = 





= 4a? cos? t (1 — cos t)? + 4a? sin? £(1 — cos t)? = 2a (1 — cos £). 


Clearly, (x — a) and y are the projections of KM on the x and y axes, 
whilst they are also equal, as is seen from the expressions above, to the length 


KM multiplied by cos t and sin ¢ respectively, and hence it follows that KM 
forms an angle ¿ with the positive direction of OX, i.e. is parallel to the 
radius ON. This result becomes important later, when giving a rule for con- 
structing the tangent to a cardioid. 
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We introduce angle 0 = a — t, formed by KM with the negative direc- 
tion of OX. We now have for r: 


r = 2a(1 + cos 0). 


This is the polar equation of the cardioid; the curve will be considered 
in more detail when polar coordinates are discussed. 
We now note some particular cases of hypocycloids. Setting b =2a 
in (38), we get: 
x = 2a cos t = b cost, y=0, 
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ie. if the radius of the fixed circle is twice the radius of the rolling circle, 
the point M moves along a diameter of the fixed circle. 

We now take b = 4a. The hypocycloid in this case consists of four branches 
(Fig. 100), being called an astroid in this particular case. With b = 4a, 
(38) now gives: 


æ = 3a cost + a cos 3t = 3a cost +a(4 cos? t — 3 cost) = 
= 4a cos? t = b cos? t, 

y = 3a sin t — a sin 3t = 3a sin t — a(3 sin t — 4 sin? t) = 
= 4a sin? t£ = b sin? t. 


We can eliminate the parameter t by taking the 2/3 power of each side of 
the above equations, then adding the equations term by term; this gives us 
the implicit equation of the astroid: 


ara + yola = EN 


81. Involute of a circle. This is the name of the curve described by the 
end M of a flexible string, gradually unwound from a fixed circle of radius 
a, so that it remains tangential to the circle at the point K where it leaves 
the circle (Fig. 101). 
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Taking as the parameter ¢ the angle between the positive direction of OX 


and the radius drawn to the point K, and noting that KM = arc AK = at, 
we obtain the equation of the involute of a circle in parametric form: 


x = Projox OM = projox OK + projox KM =acost + at sint, 


y = projoy OM = projoy OK + projoy KM = a sint — at cost. 


We use the first of formulae (33) to find the slope of the tangent: 


,__ 9 cos t — a cos t+ at sin t 





== tan f. 





—asint+asint+atcost 


The slope of the normal to the involute of a circle is thus: 


1 
—cot t=tan (t7 72). 


2 


whence it is clear that MK is the normal to the involute. This property holds 
good, as we shall see later, for the involute of any curve. 


82. Curves in polar coordinates, The position of a point M on the 
plane (Fig. 102) is defined in polar coordinates: (1) by its distance r 


^ 
ft 
ON a) 
Fra. 102 
Y 
pr 
14 
e^ ig 
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from a given point O (the pole), and 
(2) by the angle 0 between the direc- 
tion of OM and some given direction 
L (the polar axis). It is usual to refer 
to r as the radius vector and to 0 as 
the polar angle. If the polar axis is 
taken as axis OX, and the pole O as 
origin, we obviously have (Fig. 103): 


z = r cos 0, y = rsin0. (39) 


Toa given position of the point M there 
corresponds a single determinate posi- 
tive value of r, but an infinite number 
of values of 6, differing by multiples of 


2x. If M coincides with O, r = 0, and 0 is completely indeterminate. 

Any functional relationship of the form r = f(0) (explicit) or 
F(r, 0) = 0 (implicit) has a corresponding graph in the polar system 
of coordinates. The explicit form is more commonly encountered: 


r =f(0). (40) 
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We shall later consider negative, as well as positive, values of r, 
it being agreed to take r in the opposite direction to that correspond- 
ing to 0, in the case when the value of r corresponding to 0 is negative. 

Assuming that r is a function of 0 for a given curve, equations (39) 
are seen to represent the parametric equations of this curve, where z and 
y depend on the parameter @ both di- 
rectly and through the medium of Y 7 
r. We can thus apply formulae (33) 
and (34) [77] in this case. On letting 
a denote the angle formed by the 
tangent with axis OX, we have by 
using the first of formulae (33): 





tan quos r sin 04-r cos 6 : 
r” cos 0 —r sin 0 
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where r’ denotes the derivative of r 
with respect to 0. 

We now further introduce u, the angle between the positive direc- 
tion of the radius vector and the tangent to the curve (Fig. 104). 
We have: 

p=a—-0, 
and hence: 
cos u = cos a cos 0 + sinasinó, 


sin u = sin a cos 0 — cos a sin 0. 


Differentiating equations (39) with respect to s, and noting that 
dx/ds and dy/ds are respectively equal to cos a and sin a, we get: 


dr : dé r i dr dé 
cos a = cos O ¿7 —rsinQ-—, sin a = sin Q q; + rcos 0 =. 


Substituting these expressions for cosa and sina in the above 
expressions for cos u and sin p, we have: 





d. x do 
cos => sin u = "S (41) 
and hence: 
rdo r r 











tan u 
It follows from (39): 

dz = cos 0 dr — r sin 0 dé, 

dy = sin 0 dr + r cos 0 d0 , 


dr ^ dyde Cw e 
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and thus: 
ds = (dz)? + (dy)? = (dr)? + r? (dé)? ; (42) 


using a = u + 0, then dividing numerator and denominator by dé, 
we also have: 





R= ds — [(dr)2 +r? (d0)2]/2 — (2 + ry 
Tu du + de pine eda st 
mur 


We get by using (41,): 
de — 1 r2 — rr” r2 — rr” 


LETT. y E 





T 

— are tan —- 
H a a y? T 
q! 





where r’ and r” are the first and second derivatives of r with respect 
to 0. Substituting this expression for du/d0 in the above expression 
for R, we have: 

(r? + pm (43) 


T2 + 2r — rr” C 





R=+4 


83. Spirals. We can distinguish three types of spiral: 


the spiral of Archimedes: r = a6, 
the hyperbolie spiral: r0 =a, (a > 0;b > 0), 
the logarithmic spiral: r = be, 


The spiral of Archimedes has the form indicated in Fig. 105, with the 
dotted curve corresponding to 8 < 0. Negative values of r correspond to 
negative values of 9, i.e. a negative r must be taken in the opposite direction 
to that defined by the 0 in question. 

Every radius vector cuta the curve an 
infinite number of times, the distance bet- 
ween any two successive points of intersec- 
tion being constant and equal to 2ax. This 
is evident from the fact that the direction 
of the radius vector corresponding to a 
given value of @ does not change if 6 is 

Fra. 105 increased by 22, 42, ...; whilst the length 
of r, given by r= a0, receives increments 
of 2an, 4an,... 

The hyperbolic spiral is shown in Fig. 106. We assume @ > 0, and consider 
what happens to the curve as @ tends to 0. It is clear from the equation 





puits 
~ 8 


that r now tends to infinity. We take some point M of the curve with suffi- 
ciently small 0, and let MQ be the perpendicular from M to the polar axis. 
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We get from the right-angled triangle MOQ (Fig. 106): 


a sin 6 
0 , 


QM =r sin 0 = 
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and as 6 tends to zero: 
lim QM = lim a À za. 
00 90 6 





The distance between the point M of the curve and the polar axis thus tends 
to a as 0 tends to zero, so that the curve has an asymptote LK, parallel to 
the polar axis and at a distance a from it. 

We see further that r does not vanish for any finite 0, but only tends to 
zero as 6 tends to infinity. The curve thus indefinitely approaches the pole O 
whilst spiralling round it, and unlike the spiral 
of Archimedes, never actually reaches O. Such 
a point is in general referred to as an 
asymptotic point of a curve. 

The logarithmic spiral is shown in Fig. 107. 

We have r = b for 0 = 0, whilst r tends to 
(+02) for 0 (-+cc); further, r tends to zero 
without vanishing as 60 — (—oo). We have 
in this case: 


r 
r’ = abea andtan u = — = —, 
r a 





ie. the angle u between the tangent and radius 
vector is constant at any point of the logarith- Fig. 107 
mic spiral. 


84. The limacon and cardioid. We draw a circle of diameter OA = 2a (Fig. 
108); we draw a radius vector through vector point O of the circumference to cut 
the circle in a point D, and produce it further to M, such that the distance 
DM = h is constant. The locus of M is in general called a limaçon. 

Noting that: 

OD=2a cos 9 and OM=r, 
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the equation of the limagon will be: 
r = 2a cos 0 + h. 


If h > 2a, this oquation gives only positive values of r, and the curve 
is as shown in Fig. 109. If A < 2a, r also takes negative values, and tho curve 
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has the form shown in Fig. 110. The curve cuts itself at the point O. Finally, 
for h = 2a, the equation of the limagon becomes: 


r = 2a(1 + cos 0), 


i.e. the limaçon now becomes a cardioid [80], differing from that of [80] 
merely in its disposition (Fig. 111). We have r = 0 for 0 = z, i.o. the curve 
passes through O. 
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We find the first and second derivatives of r with respect to 6. 


1 n 


r’ = —2asin 0, r” = —2a cos 0. 


We calculate tan g: 


r 2a (1 + cos 8) 
taok T  —2a sin 0 





1 1 1 
cot 5 0 = tan (e+ o). 
i.e. 


nad. (44) 


As was shown previously [80], the cardioid can be obtained as the curvo 
described by a point of a circle, rolling on the circle mentioned above of 
diameter OA = 2a, the diameter of the rolling circle being equal to that 
of the fixed circle. Lot C be the centre of the fixed circle, M be any point 
of the cardioid, N be the point of contact of the rolling circle at the position 
corresponding to M, and NN, be the diameter of the moving circle (Fig. 
111). We saw above [80] that the lines OM and CN, are parallelf i.e. angle 
ACN = 0, and hence: 


are NM = are ON = n — 0. 


Angle MN,N subtended at the circumference by arc NM is equal to 
n/2 — 0/2, and finally the angle between the directions OM and N,M is 
equal to: 

a (grg) grt 30.0 
whence it is clear that N,M is the tangent to the cardioid at M. We thus 
get the following rule: 

To draw the tangent to a cardioid at any point M of it, it is sufficient to 
join this point to the end N, of the diameter of the rolling circle, the other end 
of which is the point of contact of the rolling with the fixed circle ; the normal 
ts then along the line MN. 

The rule given above for constructing the tangent to a cardioid is obtained 
very simply from kinematic considerations. It is known that, in general, 
the motion of a constrained system in a plane occurs at any given moment 
by rotation about a fixed point (the instantaneous centre), the position of 
this point in general changing with the course of time. In the case of the 
rolling shown in Fig. 111, the instantaneous centre is the point of osculation 
N of the rolling with the fixed circle, so that the velocity of the point 
M, in the direction of the tangent to the cardioid, must be perpendicular 
to NM, i.e. NM must be normal to the cardioid, whilst the perpendicular 
to it, N,M, must be tangential to the cardioid. It follows from these remarks 
that the rule given for constructing the tangent must be generally applicable 
to the curves described by any given point of a circle which rolls without 
slipping on a fixed circle. 


+ Those two lines were KM and ON, in |80] (Fig. 99). 
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85. Cassini's ovals and the lemniscate. An oval of Cassini is the locus 
traced out by a point M, moving so that the product of its distances from two 
given points F, and F, is constant: 


FM - F,M = b. 


Let the length of F,F, be 2a, let F,F, be directed along the polar axis, 
and let the pole O be at the mid-point of F,F,. 
We find from triangles OMF, and OMF, (Fig. 112): 


FM? = 7? + a? + 2ar cos 6, 


F.M? = r? + a? — 2ar cos 6. 





Fie. 112 
Substituting these expressions in the equation for an oval, then squaring 
both sides, we get after simple re-arrangement: 


ri — 2a? r! cos 26 + at — b = 0, 
whence 





r? = a? cos 20 + Yat cos? 20— (at — bs), 


The cases corresponding to a? < b? and a? > b? are shown in Fig. 112, 
the second case giving the curve consisting of two separate closed loops. 
We give a detailed discussion only of the important case when a? = b?. The 
corresponding curve is called « | mniscate, its equation being: 


7r? = 2a? cos 20. 


This equation only gives real values of r when cos 26 > 0, ie. when 6 
lies in one of the intervals: 


n 3a 5n Tr 
(0.4). ED E a), 


whilst r vanishes for 











The curve is easily constructed on the basis of these data (Fig. 113). 
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The curve cuts itself at O, and the dotted lines show the tangents to the 
two branches of the curve intercepting at O. We get by differentiating both 
sides of the equation of the lemniscate with respect to 0: 








PME 
2r = — 4a? sin 28 or r= = 2al ein 2. A 
whence 
r r? 2a? cos 20 1 
tan 7 — 38: sin 26 Ba? sin 20 — — COP 20 = tan (s^ F 2) ' 
1 
To pass from polar to rectangular coordinates, we have from (39): 


x : 
r? =g? 4 y*, cos O sin o=. 


On writing the equation of the lemniscate in the form: 
r? = 2a*(cos? 0 — sin? 0) 
and substituting the above expressions, we obtain the equation of the 
lemniscate in rectangular coordinates: 
at — y? 
z? + y? 
whence it is clear that the lemniscate is a fourth order algebraic curve. 





at +10 = Dat or (224 yt)? = 2a! (at — y?) , 
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1. Find the increment in the function y = 2? corresponding to the 
following changes in the argument: (a) from x = 1 to x, = 2; 
(b) from x = 1 to z, = 1.1; (c) from z = 1 to x = 1 +h. 


2. Find Ay for the function y = yz if (a) x= 0, Ax = 0.001; 
(b) x = 8, Ax = —9; (c) x =a, dx = h. 

3. Why is it that for the function y = 2x + 3 we can find Ay 
corresponding to an increment Ax = 5 in x, but we cannot find 
Ay for the function y = 27? 

4. Find the increment Ay and the ratio Ay/Ax for the functions: 
(a) y = (1? — 2)? at x = 1 with 4 æ = 0.4; (b) y= z at x = 0 
with Ax = —0.0001; (c) y = logio at x = 100,000 with Ax = 
= —90,000. 

5. Find Ay and 4y/4x corresponding to a change in the argument 
from x to x+ Ax: (a) y = ax + b; (b) y= 2$; (c) y — a; 
(d) y = yz; (e) y = 2; (f) y = log x. 

6. Find the slope of the chord of the parabola y = 2x — a?, the 
abscissae of the end-points of which are (a) z, = 1, x, = 2; (b) 
2, = 1,x, = 0.9; (e) x, = 1, x, = l + h. To what limit does the 
slope of the chord tend in the last case as h—> 0. 
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7. The motion of a point is governed by the law s = 2é + 3¢ + 5, 
where the distance s is measured in centimetres and the time ¢ 
in seconds. What is the value of the average speed of the point 
in the time interval from t = 1 to t = 5? 

8. Find the average rise in the curve y = 2* in the interval 
l<a< 5. 

9. Find the average rise in the curve y = f(z) in the interval 
(x, x + Ax). 

10. What is understood by the statement that the slope of the curve 
y = f(x) is given at the point x? 

11. Give a definition of (a) average angular velocity, (b) instantaneous 
angular velocity. 

12. In a heated body whose temperature is changing with time how 
should we define (a) average rate of change of temperature (b) 
instantaneous rate of change of temperature ? 

13. What should we understand by rate of change of reacting matter 
in a chemical reaction ? 

14. Let m = f(x) be the non-homogeneous distribution of mass in a 
bar occupying the interval (0, x). What should be understood 
by (a) the mean linear density of matter in the interval 
(x, + A x); (b) the linear density of matter at the point x. 

15. Find the ratio 4y/4x for the function y == 1/xat the point x = 2 
if (a) Ax = 1; (b) Av = 0.1; (c) Ax = 0.01. What is the value 
of the derivative y'(2)? 

16. Find the derivative of y — tan x. 

17. Find the derivatives of the functions (a) y = 25; (b)y = lja?; 
(c) y = yz, (d) y = cot x. 


3 

18. Calculate f/(8) if f(x) = la. 
19. Find f’(0), f’(1), f'(2) if f(z) = x(x — 1)? (x — 2)*. 
20. The motion of a point is governed by s — 5/? where s is given in 

metres and ¢ in seconds. Find the velocity of the point when t = 3. 
21. What are the values of the slopes of the curves y= 1/x and y= x? 
at the point at which they intersect? Find the angle between 
the tangents to the curves at the point of intersection. 


22. State which of the following functions do not possess finite deri- 
3 
vatives at the points stated: (a)y=Ya* at » —0, (b)y= 
3 —— 
= /a—1 stz—1; (c) y=|cosw | atthe points v = (k 4-1) a, 
Eo, +1, ES. 
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Find the derivatives of the following functions (23 to 160): 
23. a5 — 4a* + 22 — 3. 24.1 — 5a - à? — Fart 


25. aa? + bx + c. 26. —5x3ja. 27. at" + btt", 
28. ax® + b. 29. zz + log 2. 30. 32?? — 2? +. 073, 


31. 2a 32. af Vz = boys. 
33. (a + bx)/(c + da). 34. (2x + 3)/(a? — 5x + 5). 35. 2/(2x — 1) —1/x. 


1 + ya/(1 — J2). 
37. b sin x -+ 3 cos x. 38. tan x — cot x. 
39. (sin x + cos z)/(sin x — cos x). 40. 2t sint — (t? — 2) cos t. 
Al, arc tan x + are cot x. 42. x cot x. 43. x arc sin z. 


44. 3 (1 + 2?) arc tana — $2 45. 2:7 e. 


46. (x — 1) e”. 47. x-?e 7. 48. a5e ". 49. e* cos x. 
50. (x? — 2x + 2) e”. 51. e* are sin x. 


52. a*lloga.53.a?log x — i a. 54. = + 2 log x — (log x)/x. 


55. (E 56. (2a + 3by). 
57. (3 + 22°)4. 58. Š (2 — 1) ^ — 2 (2 — 1) *— yy (22 — 1)7* 


59, yi—a l—za?. 60. Ya + a + ba 

61. (a? — a?» 62, tan æ — $tan?z - ¿tan? x 
63. cotz — /cota 64. 2x + 5 cos? a. 

65. cosec? t + sec? t. 66. — ¿(1 — 3 cos a)". 
67. į cos” ^z — cos 'z. 68. (2 sin z — 2 cos a) 
69. y = sin?? x + cos? x. 70. /1 + arcsinz 


71. Vare tan z — (aro sin 2)?. 


zt. 


1/2 


———Ó A e 
72. 1/arctan x. 73. Yre + x. 74. /2e* — 2* + 1 + log! x. 
75. sin? 5g cos? iz. 


76. — 2 (z— 2)? — 4(g — 1). 
77, — B (æ 3) * — L(g —8)? —1(z— 3). 
78. 1a? (1 — 23)7*. 79.01 Yaa? — 2x $ 1. 


80. 2 81. m 
a? Ya? + a? 3/(1 + z2) 


82, 2.98 41870 Ca SS y e oun 


—]1M7/4 
83.1 (1 4 x3) — (1 + a), sy (3) . 
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85. x*(a — 2x*)?. 
86 ( a + ba" y 


a — bar 
87.2(z4-2) — 3 (£ + 2) + 2 (£ + 2)? — 1m 4 2)? 
88. (a + x) Va — a. 
89. V(x + a) (x + b) (x + c). 


90. y 4-y'y 91. (2t + 1) (3t + 2) (3t + 2)". 92, (ay — yp. 
93. log[/1 + e* — 1]— log [V1 4- e* + 1]. 
94. i cos? z (3 cos? æ — 5). 
95. (ante — 1) (tant x + 10 tan? z + 1) 
3 tan? x 


96. tanë 5x. 97. 3 sin (a?) 

98. sin?(t3). 99. 3 sin x cos? x + sin? x. 
100. i tan?x — tanz + 1 101. — Saas + 1 cot g. 
102. Va sin? x + B cos? æ. 103. arcsin (x?) + are cos (x°). 
104. 2 (arc sin x)? arc cosx. 105. are sin [(x? — 1)/a?]. 


106. cii sin [z/Y1 + a? + a?]. 107. (arc cos x) )/l1 — a — a? 
108. — ^ arc sin (s I] 109. Va? — a? + a arc sin (xja). 


110. x/a? — a? + a? arc sin (gja). 111. arcsin (1 — a) + \ 2a — e2. 
112, (x — 3) arc sin Vz + 4 Vz — 22, 

113. log (are sin 5x). 114. arc sin (log 2). 

115. arc tan [x sin a/(1 — x cos a)]. 

116. 5 arc tan G tan > + 5). 

117. 3b? arc tan |/z/(b — z) — (3b + 22) Vox — 2?. 

118. — 2 arctan (tan 2//2) — x. 119. /e**. 120. e*"** 

121. (2ma™ + by”. 122. e” cos B t. 


123. 45 fz — A pe ax 124. ¿e7* (3 sin 3z — cos 32). 


125. z"a^*'. 126. yeosz als, 127. get um 

128. log (ax? + bx + c). 129. log [x + Y'a? + z?]. 
130. x — 2/x + 2log (1 + Yz). 

131. log [a + x+ Vr? + 2a%]. 132. 1/log? x. 


—2)5 
133. logoos = 134. lps 


135. log LPGA 136. — ¿cosec? g + log tan 2. 
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137.5 z a? — a? — 5a? log [x + ya? — a] 
138. log log (3 — 22?). 139. 51og? (ax + b). 
140. log HER +2 
Vr? -Ha — y 
141. ¿mlog (x?~ a?) o 


v—a 


a le 98 y x sta a` 
142. vsin (logz —; 1g). 149, 3 log tan = z — cosa/(2sin? z). 
144. |/z? + 1 —log[1 + yz? + 1] + log x. 

2 n 
145. soper 146, 2*:5523* .|. (1 — arc cos 3z)?. 
147, 35naxos x 1 1 sin? az[cos? ba. 
1 tan i z-4-2—y3 


148. = 
ya 98 tanizJd24 ya 





149. arc tan log x. 150. ips are sin z + $log? g = arc sin log a. 


2z—1 


151. arctanlog a: 152. 2 arc tang Tale; TI 





isi op tiur 


1 —Y/sinx 


+5 arc tan g. 





: 
154, ¿log GH + Hog Z T1 
155. 5 log(1 fe log TERN 1) + ao tan 


156. (x arcsin)//1 — a? + log /1— z2. 

157. |x |. 158. x |x |. 159. log | x | (x 4 0). 

160. f(x) = 1 — rif z < 0, f(x) =e % s 0. 

161. If f(x) = e^* find f(0) + xf'(0). 

162. If f(x) = VI + x find f(3) + (x — 3) f'(3). 

163. If f(x) = tan x, D(x) = log (1 — 2), find f'(0)/0'(0). | 

164. If f(z) = 1 — x, (z) = 1 — sin (zz), find ®’(1)/f’(1) 

165. Prove that the derivative of an even function is an odd function, 
but that the derivative of an odd function is an even function. 

166. Prove that the derivative of a periodic function is a periodic 


2z —1 
ya 





function. 

167. Prove that the function y = ze * satisfies the equation xy” = 
= (1 — 2) y. i 

168. Prove that the function y=ze~*'? satisfies the equation xy’ = 
= (1 — 2°) y. 


169. Prove that the function y = (1 + z + log 2)! satisfies the equa- 
tion xy” = y(y log x — 1). 
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By taking logarithmic derivatives find y’ when y is given by (170--179): 
170. (x + 1) (2x + 1) (32 + 1). ITL (x + 2? (x + 137 (x + 3)74. 
172. Vx(x — 1) (x — 2). 173. w[a2f(a? + 1)] ^. 


174. jz. 175.219. 176. 2%. 177. a, 

178. (cos z)?"*. 179. (1 + x 3)*. 

Find the derivative y” = dy/dx when y is defined as a function of x 
by the parametric equations 180—192: 

180. 2 == 2£ — 1, y = È. 181. x = (t + 1), y = P+ 1)7?. 

182. x = 2at/(1 + ?). y = a(1 — 8/1 +. 





183. x == 3at/(1 + 1%), y = 3aP/(1 + t). 184. x = 412, y = PB, 
185. x = YTF È, y = (t— »/yf1 +e. 

186. x = a(cos t + t sin D), y = a(sint — t cos t). 

187. x = a cos? t, y = b sin? f. 188. x = a cos? t, y = b sin? f. 
189. x == cos? t(cos 2f) 1/?, y = sin? t(cos 20) 1/2. 

190. x = arc cos (i? + 1) 12, y = are sin t( + 1) 172. 


191. x = e^, y = e". 192, x = a(logtan}t -+ cost — sint), y = 
— a(sin f + cos t). 

193. Find dy/dx at t = 1 if x = t log t, y = t^! logt. 

194, Find dy/dx at t= 2/4 if x= e cost, y — e sint. 

195. Prove that the function y(x) defined by the parametric equations 
x = 2t +4 3P, y = t? + 20? satisfies the equation y == (y^)? + 2(y^)*. 

196. At the point «=-2 we have the relation z? = 2x. Does (1?) '=(2x)” 
at x = 21 

197. Let y = Ya? — x?. Is it possible to differentiate the equation 
a? + y? = a? term by term? 

In 198—215 find the derivative y” = dy/dx for the implicit func- 
tion y: 

198. 2x — 5y + 10 = 0. 199. x*%/a? + y?/b? = 1. 200. z? + y? = a?. 

201. x? 4- a?y + y? = 0. 202. /x + Vy = Va. 203. a? + y?n = 

204. (x + y) y? = x — y. 205. y — 0.3 sin y = x. 

206. a cos? (x + y) = b. 207. tan y = xy. 

208. xy = arc tan(x/y). 209. arc tan(x + y) = x. 

210. e” = x + y. 211. log x + e7”™ = c. 212. log y + aly = c. 

213. arc tan (y/x) = + log (2? + y?). 

214. Vz? + y? = c arc tan (y/x). 215, 2 = y”. 

216. Find the angles at which the curves y = sin z and y = sin 2x 
cut the z-axis. 


217. Find the angle at which the curve y — tan x cuts the x-axis. 
218. Find the angle at which the curve y = e°** cuts the line x= 2. 


a* 3, 
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219. Find the points at which the tangent to the curve y = 3x% + 
+ 4a? — 12a? + 20 is parallel to the x-axis. 

220. Find the equation of the parabola y = a? + bx + c which touches 
the line y = x at the point (1, I). 

221. Determine the slope of the tangent to the curve a? + y? — xy — 
— 7 — 0 at the point (1, 2). 

222. At what point does the curve y? = 22? have its tangent perpen- 
dicular to the straight line 4r — 3y + 2 — 0? 

223. Find the equations of the tangent and normal to the parabola 
y — Vx at the points with abscissa 4. 

224. Write down the equations of the tangent and normal to the curve 
y = 2? + 2a? — 4x — 3 at the point (—2, 5). 

225. Write down the equations of the tangent and normal to the curve 
y = (x — 1)? at the point (1, 0). 

226. Write down the equation of the tangent and normal at the point 
(2, 2) to the curve x = (1 + £)/£%, y = 3t^?[2 417?/2. 

227. Write down the equations of the tangents to the curve x — 
= t cos f, y = t sint at the points ¿=0 and t =x/4. 

228. Write down the equations of the tangent and normal to the curve 
z? + y + 2x — 6 = 0 at the point with ordinate y = 3. 

229. Write down the equation of the tangent to the curve 25 + y» = 
— 2xy at the point (1, 1). 

230. Write down the equations of the tangents and normals at the 
points where the curve y = (x — 1) (x — 2) (x — 3) cuts the 
X-axis. 

231. Write down the equations of the tangent and normal to the curve 
yt = 4x* + 6xy at the point (1, 2). 

232. Prove that the segment cut off by the coordinate axes of a 
tangent to the hyperbola xy = a? is bisected by the point of 
contact. 

233. Prove that the segment cut off on the coordinate axes by a 
tangento tthe astroid 2?*-+ y? — a? has constant magni- 
tude a. 

234. Prove that the normals to the evolute of the curve x = a(cos t + 
+étsinf), y = a(sint — t cos tł) are tangents to the circle 
a? oy? = a?. 

235. Find the angle at which the parabolas y — (x — 2)?, y — —4 4- 
+ 6x — a? intersect. 

236. Find the angle of intersection of the parabolas y = a? and the 
curve y = af, 
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237. Prove that the hyperbolas xy = a?, a? — y? = b? intersect at 
right angles. 

238. For the parabola y? = 4x, calculate the lengths of the subtangent, 
the subnormal, the tangent and the normal at the point (1, 2). 

239. Find the length of the subtangent at an arbitrary point on the 
curve y = 2. 

240. Prove that the length of the normal at an arbitrary point on the 
rectangular hyperbola a? — y? — a? is equal to the length of the 
radius vector of that point. 

241. Prove that the length of the subnormal at an arbitrary point on 
the hyperbola z? — y? = a? is equal to the abscissa of that point. 

242. The motion of a point along the line OX is described by the equa- 
tion z = 3t — 1%, x being measured in cm and ¢ in sec. Find the 
velocity of the point at the times t = 0, 1, 2 sec. 

243. The motion of two points along the x-axis is described by the 
equations x = 100 + 5f, and x = 1?/2 fort > 0. Find their relative 
velocity when one passes the other. 

244. A rod AB of length 5 cm rests so that A is on the x-axis and B 
is on the y-axis and OA = 3 em. If A begins to move along OX 
with velocity 2 cm/sec. find the initial velocity of B. 

245. At time t a projectile fired with velocity v, in a direction making 
an angle a with the horizontal x-direction, is at (x, y) where 
x= Vot cos a, y = vot sin a — gi?/2. Find (a) the constraint 
equation of the path of the projectile; (b) the range of the 
projectile; (c) the magnitude and direction of the velocity at 
time 1. 

246. A point moves along the hyperbola y = 10/x in such a way that 
its abscissa changes at a constant rate of 1 unit per second. 
At what rate is its ordinate changing at the point (5, 2) ? 

247. At what point on the parabola y? = 18 x is the ordinate changing 
twice as fast as the abscissa ? 

248. One side of a rectangle has fixed length a = 10 cm, while the 
other one 5 increases at the rate 4 cm/sec. At what rate is (a) the 
diagonal, (b) the area, of the rectangle increasing at the moment 
when b = 30 cm? 

249. A point moves on the spiral r = a 9 (a = 10 cm) in such a way 
that the angle 6 changes at the rate of 6° per second. Find the 
rate at which r changes when it has the value 25 cm. 

Find the second derivatives of the following functions 250—256: 

250. 28 + 729 — 5a + 4. 251. e”. 252. sin? x. 253. log (1 + a?) ^. 
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254. log x + Va? + a2. 255. (1 + a?) arc tan x. 256. (arc sin z)?. 


257. Show that the function y = i x? + x -+ 1 satisfies the differential 
equation 2yy" — 1 + y^. 

258. Show that the function y = } x'e” satisfies the differential 
equation y" — 2y' + y = e*. 
259. Show that for any values of the constants c, and c,, the function 
y = ce * + ¢,e ™ satisfies the equation y" + 3y’ + 2y = 0. 
260. Show that the function y=e sin 5x satisfies the equation 
y" — 4y’ + 29y = 0. 

261. If y = 2° — 52? + 7x — 2, find y'”. 

262. If f(x) = (2x — 3)5, find f"'(3). 

263. Find y for the function y = log(1 + 2). 

264. Find y” for the function y = sin 2z. 

265. Show that the function y = e-* cos satisfies the differential 
equation y + 4y = 0. 

266. If f(x) = e* sin x, find f(0), f’(0), f’(0) and f""(0). 

267.1f the motion of a point along the x-axis is governed by the 
equation x = 100 + 5t — 0.001£3, find the velocity and acceler- 
ation of the point at the instants t = 0, t = 1, t = 10. 

268. Find the nth derivative of (ax + b)", n being a positive 
integer. 

269. Find the nth derivative of the functions (a) (1 — z)-!, 
(b) a, 

270. Find the nth derivative of the functions: (a) sin x; (b) cos 2x; 
(c) e~*; (d) log(1 + 2); (e) (14073; (f) (1. + 2)/. — 2); (g) sin? a; 
(h) log(ax + b). 

271. Using Leibniz's formula find the nth derivative of the functions: 
(a) ze”; (b) z? e”; (c) (1—2?) cos x; (d) y = (1 + 2)//2; (e) 2? log x. 

272.1f f (a) = log [1/(1 — 2)], find f? (0). 

In Exercises 273—276 find d?y/da?: 

273. (a) x = logt, y = #; (b) x = arc tan t, y = log(1 + 13); (c) x = 
= arcsin t, y = YI —2?. 

274. (a) x = a cos t, y = a sin t; (b) x = a(t — sint), y = a(1 — cost); 
(c) x = acos?t, y — asin?t; (d) x = a(sin t — t cos t); y= 
= a(cos t + t sin t). 

275. (a) x = cos 2t, y = sin? t; (b) x =e", y = e”. 

276. x = arc tan t, y = P/2; (b) x = log t, y = (1 — £)71. 

277. Find d?z/dy? if x = e cost, y = e! sin t. 

278. Find d?y/da? for t = 0 if x — log(1 + 1%), y = #. 
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279. Show that if y is defined as a function of x through the parametric 
equations x = sin t, y = acl + bo Vt, where a and 6 are con- 
stants, it satisfies the differential equation (1 — a?) y” — xy’ = 2y. 

280. Find the increment Ay and the differential dy for the function 
y = 5x + x? if x = 2 and Jz = 0.001. 

281. Without calculating the derivative find d (1 — a?) if x = 1 and 
Az = — 1/8. 

282. The area S of a square with side of length x is given by the for- 
mula S = 2. Find the increment and differential in S when x 
takes an increment Az and give a geometrical interpretation of 
the results. 

283. Give a geometrical interpretation of the increment and differen- 
tial for the following functions: (a) area of a circle S = az’; 
(b) volume of a cube v = g’. 

284. Prove that as 4x—>0 the increment in the function y = 2* 
corresponding to an increment of amount Az in x is equal to 
(2* log 2) Az. 

285. For what value of x is the differential of the function y = 2? 
not equivalent to the increment of the function as Ax > 0? 

286. Has the function y = | 2 | a differential at x = 0? 

287. Making use of the derivative, find the differential of the function 
y = cos x for æ = 2/6 and Az = 2/36. 

289. Replacing the increment by the differential of the function, 
calculate approximately :(a) cos 61°; (b) tan 44°; (c) e*-?; (d) log 0.9; 
(e) are tan 1.05. 

290. Find the approximate value of the increase in the volume of a 
sphere of radius 15 cm when its radius is increased by 2 mm. 

291. Prove that, approximately, 

Va + Ax = Vx + = 
and use this result to find approximate values of V5, V17, /70, 
620. 
292. Prove that, approximately, 
i——-~ on Az 
Va+ Az = Ya + -—; 


3 la? 





3 3 3 
and find approximately the values of /10, |/70, V200. 
293. Find approximate values of the functions: (a) x? — 4a? + 5x + 3 
when g = 1.03; (b) /14+x when x = 0.2; (c) /(1—20)/(1 + 2) 


when z = 0.1; (d) et when g = 1.05. 
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294. Find an approximation to the value of tan 45*3'20”. 

295. Show that on the intervals —1 « x « 0 and 0<x< 1l, the 
function f(x) = x — 2? satisfies Rolle's theorem. Find the corres- 
ponding value of the mean value ¿in Rolle’s formula. 


3 
296. The function f(x) = |/z — 2? assumes the equal values f(0) = 


= f(4) = JZ at the end-points of the interval [0, 4]. Does this 
function satisfy the conditions of Rolle's theorem in [0, 4]? 

297. Does the function f(x) — tan x satisfy the conditions of Rolle's 
theorem in [0, x]? 

298.1f f(x) = z(r-r-l)(x-4-2)(x 4-3) show that the equation 
f'(x) = 0 has three real roots. 

299. The equation e* = 1 + x obviously has the root æ= 0; show 
that it has no other real root. 

Evaluate the limits 300 —311: 


m : l-g 
300. lim 2.95.7 PA $01, lim ==*-... 
+. " x1 (l— sing 72) 
im * — si : Ze 
302. lim “22. 303. lim 2 ee 
xo 'U—SIn x 1 1 + cos 4x” 
X— — n 


4 


304. lim (1 — x) tan (s ma). 305. lim cot z X arc sin x. 





x1 x0 
306. lim x" e-*, (n > 0). 307. nm x" sin (ajx), n > 0. 
x0 
308. lim log zxlog (z— 1) 309. lim 7 sates 
x1 Wu 
1 5 P 
310. lim E E E 311. (a) lim (x? + y?) sin (1/zy) ; 
29 
à sin xy | 
O imati A 
you y2 
i qc], EE UNDE 
Ome Om O IE 
yk y>0 y>0 


312. Investigate the continuity of the function 
[Vi—m—y ife 
Jj (x, y) = : 

| 0 ife+ty>l 


313. Find the points of discontinuity of the following functions: 
(a) log Vx? + y*; (b) (x — 9) 5; (e) (1 — 2? — y)73; (d) eos(1/zy). 
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Find the first derivatives 02/0x and 9z/0y of the functions 314—324: 

314. z = 2? + y? — 3axy. 315. 2 = (x — y)/(x + y). 

316. 2 = y/x. 317. z = Ya? — y?. 

318. 2 = 2//22 + y2. 319. z= log [x + Vx? + y?]. 320. z= are tan(y/x) 

321. z = x". 322. z = exp(sin y/x). 

323. z = arc sin / (x? — y?)/(z? + y?). 324. 2 = log sin[(« + a) y 1^]. 

325. If u = w”, find the derivatives uy, uy. 

326. If u = log(1 + ay) find ux, w. 

327. Find 0//0x, 9f/0y at (2, 1) if f(x, y) = Vay + x]y. 

328. Find 9f/0x, af/ey, at the point (1, 2) for the function f(x, y) = 
= log(zy). 

329. For the function f(x, y) = xy find the total increment Af and 
the total differential df at the point (1, 2). Calculate Af — df 
if (a) Av = 1, Ay = 2; (b) Ax = 0.1, Ay = 0.2. 

330. For any pair of functions u, v, show that (a) d(u + v) = du + dv; 
(b) d(uv) = udv + vdu; (c)d (ujv) = (vdu — udv)/v?. 

Find the total differential of the following functions: 

331.2 = a? + y? — Bay. 332. z = xy. 333. z = (x? — yP)? + y?). 

334. z = sin? x + cos? y. 335. 2 = yz". 336. z = log(a? + y’). 

337. f = log(1 + x/y). 

338. One side of a rectangle is a = 10 cm, the other is b = 24 cm. Find 
the approximate change in the length of the diagonal 1 (by cal- 
culating dl) when a is increased by 4 mm and 6 is decreased by 
1 mm. Calculate the exact value Al. 

339. Calculate approximately: (a) (1.02)? X (0.977); 

(b) (4.05)? + (2.93)2; (c) sin 32° x cos 59°. 

340. The period of a simple pendulum is given by the formula 
T = 2x Jig. Find the change in T if there are changes a and f 
in } and g respectively. 

341. Find dz/dt if z = x/y where x = e', y = log t. 

342 Find du/dt if u = log sin(zy-' ^) where « = 38, y = VI F t 

343. Find du/dt if u = xyz where x = 1 + t?, y = log t, z = tan t. 

344. Find du/dt if u = (x? + y?) -!? z, where x = R cos t, y =R sin t, 
z — H. 

345. Find dz/dx if z = w^, where u = sin x, v = cos x. 





CHAPTER III 


INTEGRATION: THEORY AND APPLICATIONS 


§ 8. Basic problems of the integral calculus. 
The indefinite integral 


86. The concept of an indefinite integral. One of the basic tasks of 
the differential calculus is to find the derivative or differential of a 
given function. 

The primary task of the integral calculus consists in the converse — 
finding the function, given its derivative or differential. 

Let the derivative 


y! = f(a) 


dy = f(x) de 
be given of the unknown function y. 
A function F(x), possessing a given function f(x) as its derivative, or 
f(x)dx as its differential, is called a primitive of the given function f(x). 
If, for example, 


or differential 


f(a) = a’, 
a primitive of the function will be F(x) = 4 a; we have, in fact, 
1 3 í iue d 2.42 
(+=) =3 x 32? = g’. 


Suppose that a primitive F(x) of the given function f(x) has been 

found, so that we have the relationship 
F'(z) = f(x). 

Since the derivative of an arbitrary constant C is equal to zero, 

we also have: 
LF(z) + CY = F’(z) = f(z), 

ie. the function F(z) + C is also a primitive of f(x). 

Hence it follows that, if the problem of finding a primitive has 
one solution, it will have an infinity of further solutions, differing 
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from the first by an arbitrary constant. On the other hand, it can 
be shown that there are no other solutions apart from these, i.e. 
If F(x) is any one primitive of a given function f(x), any other primi- 
tive has the form : 
FG) +0, 
where C is an arbitrary constant. 
Let F,(x) be any function, whose derivative is f x). We have: 


Filo) = f(z). 
On the other hand, F(x) possesses the derivative f(x), i.e. 


F'() = f(a). 
Subtracting this equation from the previous one, we get: 
Fi(x) — F(x) = [Fy@) — Flay = 0, 
whence, by the theorem proved in [63]: 
FQ) — F(z) =C, 


where C is a constant: which it was required to prove. 

The result we have obtained can also be formulated as: If the deriva- 
tives (or differentials) of two functions are identically equal, the functions 
themselves differ only by a constant. 

The most general expression for a primitive is also referred to as the 
indefinite integral of the given function f(x), or of the given differential 
f(x)dx, and is denoted by the symbol 


f. fido, 

f(x) being referred to as the integrand, and f(x)da as the integrand 
expression. 

Having found one primitive F(x), we can write, by what was 
shown above: 

Sfla)da = F(x) +0, 

where C is an arbitrary constant. 

Mechanical and geometrical interpretations can be given of the 
indefinite integral. Suppose we have a law giving an analytic relation- 
ship between velocity and time: 


v= f(t), 
and we want to express the path s in terms of time. Since the velocity 
of a point in a given trajectory is the derivative ds/dt of the path 
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with respect to time, the problem reduces to finding a primitive of 
the function f(t), i.e. 


s = (f(t) dt. 


We get an infinite number of solutions, differing by a constant 
term. This lack of precision in the answer results from us not fixing 
the point from which the traversed path s 
is measured. If, for instance, v = gt + v, 
(uniformly accelerated motion), we obtain 
the expression for s: 


$= —- gl? + vt +O, (1) 


because, as is easily shown, the deriva- 
tive of (1) with respect to ¢ coincides 
with the given expression v = gt -|- Vo 
If we agree to measure s from the point 
corresponding to t= 0, ie. if we agree 
to take s = 0 at t= 0, we have to put the constant C = 0 in (1). 
Of course it is of no significance that we have denoted the inde- 
pendent variable by ¢ in the above discussion, and not by x. 

We now pass to the geometrical interpretation of the problem of 
finding a primitive. The relationship y” = f(x) shows that the graph 
of any required primitive, or, as we usually say, of any integral 
curve: 
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y= F(x), 
is such that the tangent to the curve for any given x has the direc- 
tion determined by the slope 


y = f(x). (2) 


In other words, the direction of the tangent to the curve is given 
by (2) for any given value of the independent variable x; the problem 
is to find this curve. Having constructed one such integral curve, all 
the curves obtained by moving this by any amount ina direction parallel 
to the axis OY will have parallel tangents with the same slope 
y’ = f(x) as in the case of the initial curve, given the same value 
of x (Fig. 114). The parallel shift referred to is equivalent to adding 
a constant C to the ordinate of the curve; and the general equation 
of the curves, giving solutions of the problem, will be: 


y = F(a) t C. (3) 
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In order to define fully the position of a curve, i.e. to fully define 
the expression for the primitive, a point must be assigned through 
which the integral curve must pass. The point assigned may be the 
point of intersection of the curve with a line 


ES 


parallel to the axis O Y. Thisis equivalent to assigning the initial value y, 
that the required function y = F(x) takes for the given value z = vy 
We substitute this initial value in equation (3), and obtain an equa- 
tion defining the arbitrary constant C: 


Yo = Fla) +C, 


so that finally, the primitive satisfying our initial condition will 
have the form: 


y = F(z) + [yo — F(x))]. 


Before examining the properties of the indefinite integral, and 
methods for finding the primitive, we note a second basic problem 
of the integral calculus, and examine it from the point of view of 
the problem already stated — namely, that of finding the primitive. 
A new concept is essential for what follows, this being the concept 
of a definite integral. We choose a natural approach here by starting 
from the intuitive idea of area, which also enables us to examine the 
connection between the concepts of definite integral and primitive. 
The discussion of the next two articles, based as itis on the intuitive 
idea of area, cannot be considered as a rigorous proof of new facts. 
A logically rigorous approach to the fundamentals of the integral 
calculus is indicated at the end of [88], whilst a full discussion is 
given at the end of the present chapter. 


87. The definite integral as the limit of a sum. We take the graph 
of the function f(x) in the plane XOY, and assume that it consists 
of a continuous curve, lying wholly above OX, i.e. all ordinates of 
the graph are assumed positive. We consider the area Sas bounded 
by OX, the curve, and the two ordinates x =a and x — b (Fig. 115), 
and we try to find the magnitude of this area. We start by dividing 
the interval (a, b) into n parts by means of the points: 


a = Xy < Ly X... Lpg L Ilo Ena Tb. 
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The area Sas is now divided into n vertical strips, the length of 
base of the kth strip being (xy — x, .,). Let m; and M, respectively 
denote the least and greatest values of function f(x) in the interval 
(k1, £k), i.e. the least and greatest ordinates of our graph in this 
interval. The area of the strip lies between the areas of the two rectang- 
les of heights my and M, and having the common base (xy — a. 1) 
(Fig. 116). These rectangles are the “interior” and “exterior” rectang- 





Xe-1 Xk 


Fie. 115 Fie. 116 


les for the kth strip. The magnitude of the area of the kth strip is 
thus comprised between the areas of the rectangles in question, i.e. 
between the two numbers: 


My (Ey — 2, 4) and My (Ly — X%_,), 


and hence the total area Sa will lie between the sums of the areas of 
these interior and exterior rectangles, i.e. Sa, lies between the sums: 


Sa = m4 (2, — 29) + Mg (Za — 2,) E F Ml — v)... 
mca (Ups — v3) + My (En — 0,4), (4) 
S, = M, (x, — Zo) + My (x, — 2) +... + My (Ek — xu) +--+ 
Maa (Ls — Lp 2) + My (En — 04). 
We thus have the inequality: 
Sn < Say < Sp. (5) 


We now draw a mean rectangle in place of the interior and exterior 
rectangles for each strip, taking base (x, — x1) as usual but with 
the height taken as the ordinate f(é,) of our curve at any given point 
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E, of the interval (a1, x4) (Fig. 117). We consider the sum of the 
areas of these mean rectangles: 
Sn = HE) (2, — Lo) + Flo) (Za — 0$) HH HEY) (Ek — m) + 
+ ... + Hên) (Sr, 2e La) + f(&) (£5 vo duca 


This, like the area Sap, will lie between the sums of the areas of 
the interior and exterior rectangles, i.e. we have 


(6) 


s, < 8, < Sp. (7) 


We now indefinitely increase the number 
n of divisions of the interval (a, b), and 
so also make the greatest of the diffe- 
rences (zx, — z&.,) tend to zero. Since 
f(x) is continuous by hypothesis, the 
difference (M, — my) between its greatest 
and least values in the interval (x,.,, 
zy) will tend to zero with indefinite de- 
crease of the length of this interval, irre- 
spective of its positionin the fundamental interval (a, b) (see the 
property of a continuous functionin [35]. We thus have e, tend- 
ing to zero on passing to the above-mentioned limit, where ej, 
is the greatest of the differences: 


(M, — mj), (M, — mj... (M, — my)... 
(M n-i la Thai), (M, e m). 


We now state the difference between the sums of the areas of the 
interior and exterior rectangles: 


Sy — Sn = (M, — m) (€, — 29) + (My — mq) (2, — 2) +... + 
T (M, ES My) (Ek — 2,1) strate + (M, m Mn) (2n -— £44); 


on replacing all the (M, — my) by the greatest difference e, and 
recalling that all the (x, — c, ,) are positive, we have: 


Sn — Sn < En (2, — To) + En (£a — EH... + 
F En (tx — Ty) F. -F En (En — t4), 
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ie. 
S, — 8, < En (Un — Zo) = En (b— a). 


We can thus write: 
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ie. 
lim (S, — $,) = 0. (8) 


noo 


On the other hand, we had for any m: 
Sa < Sar < Sa (9) 


the magnitude of the area S,, being a definite number. It follows 
directly from (8) and (9) that the magnitude of area Sas is the common 
limit of s, and S,, i.e. of the areas of the interior and exterior rect- 
angles: 

lim Sn = lim S, = Dab- 


But the sum S} of the mean rectangles lies between s, and Sn, as 
we have seen, so that this must also tend to the area Sa, i.e. 


lim S, = Sab. 


The sum S; is more general than s, or S,, inasmuch as we can 
arbitrarily choose the £, in the interval (x, 4, x), and in particular, 
we can take f(é,) as equal to the least ordinate m, or the greatest 
ordinate Mx. 

With these choices, the sum S; transforms to s, or Sn- 

The above discussion leads us to the following: 

Suppose the function f(x) is continuous in an interval (a, b) ; suppose 
that, having divided the interval into n parts by the points 


A= Xy «X, «d... DA. LEk L... < Eni < XR b, 


we find the corresponding value of the function f(&,) for any x= E, in 
the interval (v, .,, Xx), and that we now form the sum : 


V= 


HE) (Ek — 24-1). (10)t 


k=1 


then this sum tends to a definite limit on indefinite increase of the number 
n of divisions of the interval and on indefinite decrease of the greatest of 
the differences (x, — Yk-1). This limit is equal to the area bounded by 
the axis OX, the graph of function f(x), and the two ordinates x = a 
and x = b. 


2 HE) (£k — *y ,) is an abbreviated notation for the sum (6). 
k=1 
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The limit in question is referred to as the definite integral of the 
function f(x) with respect to the variable x between the lower limit 
x = a and the upper limit x = b; it is denoted by: 


b 
3 f(x) da. 


We note that the existence of a limit I of the sum (10) in the case 
of indefinite decrease of the greatest of (x, — x,.,) amounts to the 
following assertion: for any given 
positive e there exists a positive 
6 such that 


n 


I— X HED) (Er — Er) < E 


k=1 
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for any chosen point éx in the 
interval (%;_1, zy), provided only 
that all the (positive) differences x, — x, , < 6. This limit I is the 
definite integral. 

We have assumed above that the graph of f(x) is wholly located 
above axis OX, ie. that all the ordinates of the graph are positive. 
We now take the general case, where part of the graph is above OX, 
and part below (Fig. 118). 

If we form the sum (6) in this case, the terms f(k) (xy — 24.1) 
corresponding to parts of the graph below OX will be negative, since 
the difference (x, — x. ,) is positive and the ordinate f(£,) is negative. 

The definite integral obtained on passing to the limit will reckon 
areas above OX with the (+) sign and areas below with the (—) sign, 
Le. in the general case, the definite integral 


fie) ax 


will give the algebraic sum of the areas included between OX, the graph 
of f(x), and the ordinates x = a and x = b. Areas above OX are here 
given the positive sign, and areas below, the negative sign. 

As will be seen later, finding the limit of a sum of the form (6) is 
not only involved in calculating areas, but is encountered in a wide 
variety of scientific problems. We take just one example. Let a certain 
point M be moving along axis OX from x =a to x= b. Letit be acted 
on by a certain force 7, also directed along OX. If the force T is 
constant, the work done in moving the point from the position = a 
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to x = 6 is given by the product R = T (6 — a), ie. the product of 
the magnitude of the force and the path traversed by the point. If the 
force T is variable, the above formula is no longer valid. Suppose that 
the magnitude of the force depends on the position of the point on OX, 
ie. we have T = f(x). 

To find the work done in this case, we subdivide the total path 
traversed by means of the points 


Q = Xy < Ly «X... Rp yy <L Tn = D, 


and we take one of the intervals (r, ,, z,). We can take the force 
acting on the point as it moves from zy. 4 to x, as constant, with an 
error that is smaller, the shorter the length (x, — 2,_), and we 
can set its value as f(£,) for some point E, of the interval (a,_3, xy). 
Hence we obtain an approximate expression for the work done in 
the interval (zy. .,, 2): 

Ry, ~ F(E,) (zy — 2,21). 


The total work done will be given approximately by: 


n 
RC. (Ej) (x — Er). 
k=1 
On indefinite increase in the number n of subdivisions and on inde- 
finite decrease of the greatest of the differences (rz, — 2%_1), we get 
in the limit a definite integral, accurately expressing the work done: 


R= ffe) dz. 


Disregarding any possible geometrical or mechanical interpretations, 
we can now fix the concept of the definite integral of a function f(x) 
over the interval a < x < b as the limit of a sum of the form (6). The 
second basie task of the integral calculus is to study the properties 
of the definite integral and, above all, to evaluate it. If f(x) is a given 
function, and z — a and x — b are given numbers, the definite integral 


b 
f f(x) de. 


is a determinate number. The $ sign is a stylized letter S, recalling 
the summation that gives, in the limit, the magnitude of the definite 
integral. The expression under the integral, f(x) dx, recalls the form 
of the term in the summation, viz., f(£) (x ,— %x-1). The letter z, 
standing under the sign of the definite integral, is usually referred to 
as the variable of integration. We note an important detail as regards 
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this letter. The magnitude of the integral is a determinate number, 
as already mentioned, and is of course not dependent on the notation x 
for the variable of integration; any letter can be used to denote the 
variable of integration in a definite integral. The choice has evidently 
no influence at all on the magnitude of the integral, which depends 
only on the ordinates of the graph of f(x) and on the limits of integration 
a and b. Since the notation for the independent variable plays no part, 
we have for instance: 


$ feas = [oat 


The second task of the integral calculus — that of evaluating the 
definite integral — consists in forming a sum of the form (6) and 
passing to the limit: This would seem a fairly complicated problem 
at first sight. We note that the number of terms in this sum increases 
indefinitely on passing to the limit, whilst each term tends to zero. 
Apart from this, the second task of the integral calculus would appear 
to have no connection with the first task, that of finding the primitive 
of a given function f(x). 

We show in the following article that both tasks are intimately 


b. 
related, and that evaluation of the definite integral | f(x) da is 
accomplished very simply, if the primitive of f(x) is known. 
88. The relation between the definite and indefinite integrals. We 


again consider the area Sap bounded by the axis OX, the graph of 
function f(x), and ordinates x — a and x — b. 





Fic. 119 


In addition to this area, we also consider a part of it, bounded by the 
left-hand ordinate x == a and by a movable ordinate, corresponding 
to a variable value of x (Fig. 119). The magnitude S,, of this latter 
area will evidently depend on where we locate the right-hand ordinate, 
ie. it is a function of x, represented by the definite integral of f(x), 
taken from the lower limit a to the upper limit x. Since the letter x is 
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used to denote the upper limit, we shall avoid confusion by denot- 
ing the variable of integration by a second letter, say t. We can 
thus write: 


Sox = ft) at (11) 


We have here a definite integral with a variable upper limit x, 
and its magnitude is clearly a function of this limit. We show that 
this function is one of the primitives of f(x). We find the derivative 
of the function by considering its increment AS,, for the increment 
Ax of the independent vari- 
able zx. We obviously have 
(Fig. 120): 


AS, = area M,MNN,. 


We denote the least and 
greatest ordinates of the 
graph of f(x) in the interval 
(x, x + Ax) by m and M res- 
pectively. The curvilinear fig- 
ure M,MNN, drawn on a 
larger scale in Fig. 120, will lie wholly inside the rectangle of height 
M and base 4x, and will contain the rectangle of height m and the 
same base; hence 








Fra. 120 


mx < AS, < Mx, 


or, dividing by Az: 
ASax 
m < Sr < M . 
By the continuity of f(x), both m and M tend to the common limit 
M,M = f(x), the ordinate of the curve at the point z, as 4x tends 


to zero; hence: 
ASax 


La He) 


lim 


which is what we wanted to prove. We can formulate the result ob- 
tained as follows: a definite integral with a variable upper limit 


j HO t 


is a function of this upper limit, the derivative of which is equal to the 
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integrand f(x) at the upper limit. In other words, the definite integral 
with variable upper limit is a primitive of the integrand. 

Having established the connection between the concepts of definite 
and indefinite integral, we now show how the definite integral 


b 
d f(x) dz 


can be evaluated, if the primitive F(x) of f(x) is known. The definite 
integral with variable upper limit is in fact a primitive of f(x), as we 
have shown, and we can write, by [86]: 


FHA de = F(z) + 0, (12) 


where C is a constant. We determine this constant by noting that 
the area S,, evidently vanishes if its right-hand ordinate coincides 
with the left-hand, i.e. if x — a; so that the left-hand side of (12) 
vanishes for x = a. Putting x = a in (12) thus gives us: 


0 = F(a) + C, ie. C — —F(a). 
Substituting for C in (12), we get: 
J f(t) dt = F(x) — F(a). 
Finally, putting x = b here, we find: 
b 
§ f(t) dt = F(b) — F(a) or fft) dx — F(b) — F(a). (13) 


We thus arrive at the following fundamental rule, giving the magni- 
tude of a definite integral in terms of values of a primitive: the magni- 
tude of a definite integral is equal to the difference between the values of 
the primitive of the integrand at the upper and lower limits of inte- 
gration. 

The rule stated shows that finding a primitive, i. solving the 
first problem of the integral calculus, also solves the second problem, 
that of evaluating the definite integral; so that we do not need to 
carry out the complicated operations of forming the sum (0) and 
passing to the limit, in order to evaluate a definiteintegral. 


We take as an example the definite integral 
1 
T a? dz. 
0 


A primitive of the function x? is 23/3 [86]. 
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We have by the rule we deduced:t 
: 1 
f a? da = 3 a 
0 


1 1 
0 3 








1 1 
3 3— 
x1 3 X 0 3 


If we were to calculate this definite integral directly from its definition 
as the limit of a sum, without using the primitive, we should find ourselves 
with a much more complicated calculation, which is briefly reproduced. 
We divide the interval (0,1) into n equal parts with the points: 


1 2 n—1 
0<—<—<...< 
n n 





* « 1. 


We now have the following n intervals: 


Uses o o 
n n n n n n 
the length of each being equal to 1/n. We form the sum (6) with ¢, taken 


as the left-hand end of the interval, i.e. 


§, =0, =>, TEA 








n— l 
EE TA $ 
n n 





We have x, — 2, 4 = l/n for all the differences, and we note that the 
integrand f(x) = x? has the values at the left-hand ends of the intervals: 


(n—1)? 
n 


1 22 
FE) — 0, f) = A fe) = ae ee K&) = . 


So we can write: 
1 


foras = lim 
0 


noo 


(n — 1)? 


1 1 1 22 1 
Wier tiga ae ar hog Pe 
(14) 


X 





E 1? 21. (n — 1} 


"n no nè 
We find the sum in the numerator by noting the series of obvious equal- 
ities: 
(14 13=1+4+3x1+3x1 + 12 
(14 23=1+3x2+ 3x2 + 22 
(1+ 3)?3=14+3x3+ 3x3? + 33 


[1 + (n =p ado se E 1) + 3(n — 1) + (n — 1). 
Adding term by term, we get: 
24384... -E n? — (n — 1) -3[1 2 4- ... (n — 1] 4- 
+ 3[12 -- 23 E... E (n — 1] 4+ 14 284 ... H4 (n — 1). 


Il 


I 


— RERUMS EE b 
+ The symbol g(z)| denotes the difference [p(b) — p(a)]. 


a 


226 INTEGRATION: THEORY AND APPLICATIONS [88 


Cancelling, and using the formula for the sum of an arithmetic progression, 
we can write: 








n= (n—1) 432120 pappan A, 
whence 
Y Ea Eoo —18-— ae _ cia i mc eee) 


Substituting the expression obtained in (14), we have: 


1 


E : n(n — 1) (2n — 1) 
x? dz = lim ————z;—;————- = 
] E 6n3 


e 


L è 1 1 2 1 
Seth lin 117: Se es PAE 
= Jim ( SIE +) 6 3 


We have now explained the basic problems of the integral calculus 
and their inter-relationship. We devote the next paragraph, to further 
consideration of the first problem, that of finding and investigating 
the properties of the indefinite integral. 

Our above discussion of the definite integral has been based on purely 
geometrical concepts, viz., on consideration of the areas Sap and Sax. 
In particular, the proof of the basic fact that the sum (6) has a limit 
started from the assumption that there exists a definite area Sp, for 
every continuous curve. This assumption has no sound basis for all 
its apparent obviousness and the only mathematically rigorous course 
would be in the opposite direction: to prove the existence of a limit S 
of the sum 

n 
= Ne) (zy — Lg) 
by direct analytio means, without regard to geometrical interpreta- 
tion, then use the limit for defining the area Sas. We give this proof 
at the end of the present chapter, and at the same time we make more 
general assumptions regarding f(x) than those of continuity. 

We also remark that a geometrical interpretation played an essen- 
tial part in proving the basic proposition that, given continuity of the 
integrand, the derivative of the definite integral with respect to 
its upper limit is equal to the value of the integrand at the upper 
limit. A rigorous analytic proof of this proposition is given in the 
next section. Combining this proof with the proof of the existence of 
a definite integral of a continuous function, we are able to assert that 
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a primitive exists for every continuous function, ie. an indefinite 
integral exists. We describe the basic properties of the indefinite 
integral later, on the assumption that we are only concerned with 
continuous functions. 

We give a rigorous proof of the basic formula (13) when we come 
to describe the properties of definite integrals. Thus, the only un- 
proved fact that remains is the existence of a limit of the sum (10) 
for a continuous function f(z). This is proved at the end of the 
chapter, as already mentioned. 


89. Properties of indefinite integrals. We saw in [86] that any two 
primitives of a given function can only differ by a constant term. 
This leads us to the first property of indefinite integrals: 

I. If two functions or two differentials are identical, their indefinite 
integrals can only differ by a constant term. 

Conversely, to show that two functions differ by a constant term, 
it is sufficient to show that their derivatives (or differentials) are iden- 
tical. 

The next properties, 11 and III, follow immediately from the concept 
of indefinite integral as a primitive, i.e. from the fact that the inde- 
finite integral 


$ f(z) de 


is a function such that its derivative with respect to x is equal to the 
integrand f(x), or that its differential is equal to the integrand expression 


f(a) da. 
IL. The derivative of an indefinite integral is equal to the integrand, 
whilst its differential is equal to the integrand expression : 


(f(x) da); = f(x); d (f(x) de = f(a) de. (15) 
III. We have, along with (15): 
$F'(x) dz = F(x) +0, 
and this can be rewritten [50] as: 
fdF(z) = F(z) +0, (16) 


which, combined with property II, gives: the signs d and $ eliminate 
each other when juxtaposed im any order, provided we agree to neglect 
the arbitrary constant in the equation for an indefinite integral. 
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IV. A constant factor can be taken outside the integration sign : 
SAf(a) da = Aff(2) de + C.t (17) 


V. The integral of an algebraic sum is equal to the algebraic sum of 
the integrals of each term : 


v 


[(u 4- v — w+ ...) dz = fuds + fede — fwde +... +0. (18) 


Formulae (17) and (18) are easily seen to be correct by differen- 
tiating both sides and observing the identity of the derivatives ob- 
tained. For (17), for instance: 


(f Aft) da)’ = Afta); 
(AS f(a) dx + 0) = A(fftz) de)’ = Afla). 


90. Table of elementary integrals. This table is obtained by simply 
reading through the items of the table of elementary derivatives [49] 
in reverse order, when we get: 





fde=a+C. 
m+1 
fe" dz = al +0, if m4 —1. 
[E = gr +0. 
[e$ — +0. [e*dz — e* 4- C. 
f sin zdz = — cos z + C. f cos sds —sin z 4- C. 
d d 
wa = tan z--C. [az oot +0. 
de de . 
[par = are tan r+C. | yee = re sin tC. 


To check this table, it is sufficient to establish that the derivative 
of the right-hand side of each equation is identical with the 
integrand on the left. In general, knowledge of the function, of 
which a given function f(x) is the derivative, gives at once the indefi- 


f The arbitrary constant is sometimes not written down after an indefinite 
integral, it being assumed that the indefinite integral already contains such 
& term. Equation (17) then runs: 


f Af(z)dz = A f f(z)dz. 
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nite integral of the latter. Usually, however, even in the simplest 
examples, the given functions are not to be found in the table of 
derivatives, which makes problems of the integral calculus a good 
deal more difficult to work out than those of the differential calculus. 
It is always a case of transforming the given integral to one contained 
in the table of elementary integrals. 

These transformations need experience and practice; they are facili- 
tated by use of the basic rules of the integral calculus to be found below. 


91. Integration by parts. If u, v are any two functions of x with 
continuous derivatives, we know that [50]: 


d(uv) = udv + vdu, or udv = d(uv) — vdu. 
This gives us, using properties I, V and IIT: 
Sudo = f[d(uv) — vdu] + € = {d(ur) — fudu + € = 
= w — fvdu +0, 
leading to the formula for integration by parts : 
Judo = uv — fvdu + C. (19) 
We use this to pass from evaluating fudv to evaluating (edv, the 


latter being possibly simpler. 


Examples. 1. $ log x dx. 
Here we put 
u=logxw, de = dv, 
glving to start with: 


da 
du == —, v=a, 
x 


whence by (19): 
| ogee == log z — [239 +0=x log x —z-4-C. 


There is no need to write down the transformations separately in practice; 
as much as possible of the working should be done mentally. 


2. f o*a? dæ = fa «e dz = | x? de* = a? e* — | e* da? = 
= z? e" — 2 | e” ade, 


$ et ada = | zde* = ze* — f o% dz = e* x — e*, 
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which finally gives . 
f eX at de = at — 2x + 2) - C. 
3. $ sin z a? de = fa? ‘sin x dx = | a? d(—cos 2) = 
= —a? cos x — ( (—cos a) dai = —g? cos a + 3 f 2? x cos æ de = 
= —2z3 cosa + 3 f æ? d sin x = —a cos æ + 32! sin z — 3 (sin a da? = 
= —a? cos æ + 3w? sin æ — 6 ( æ sin æ dz = 
= —g? cos y + 32? sin v — 6 f zd(—cos x) = 
= —g? cos g + 32? sin x + 6x cos x — 6 ( cos a de = 


= —g? cos x + 3a? sin v + 6v cos æ — 6 sin x + C. 


The method indicated by these examples is used generally for 
evaluating integrals of the type: 
$ log 2-w"dzx, fe**z"dz, f sin bz-a"dz, f cos br-a™de, 


where m is any positive integer; care is. only needed to see that the 
power of x decreases with each successive transformation, until it 
reaches zero. 


92. Rule for change of variables. Examples. An integral ff(z)da can 
often be simplified by introducing a new variable t in place of z, 
putting 

x= g(t). (20) 

An indefinite integral can be transformed simply by substituting the 

new variable in the integrand expression : 


Sf(x) de = (fto(01o"(t) at + C. (21) 


This is proved from property 1 of [89] by showing the identity 
of the differentials of the left and right-hand sides of (21). We have 
on differentiating: 

d(f f(x) dx) = f(a) de = fíp(t)] p'(t) dt , 
Af o(t)1 e^t dt) = f[e()] e'(!) de. 
The inverse is often used instead of substitution (20): 


t = y(x) and y'(x) de = dt. 


Examples. 


1. { (ae + b)" de (with m 4 —1). 
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The integral is simplified by substituting: 


ax + b — t, ade = dt, de 








dt 
x 


Setting these values in the integral given, we find: 





1 jp gt l (ax -- b)" tt 
m = — m = — — = — — / 
| (as + 5) ds: art +e. 
dx _ 1 Zli log (ax + b) 
5-3 t etre a +0 





* Jase rag c rte - 


1 x TET. x 
= {are tan +0 (substituting t= z) . 





dz A d (x/a) E SQUE 
4. = ia jap arc sin- +C. 
5 dz 


V2? +a j 
We evaluate this integral by using Euler’s substitution, about which more 
will be said below. We introduce the new variable t given by the formula: 


Ve F a=t— az, i=2+ Va Fa 
We square both sides to find x and du: 


2 
x? + a = t — lo +2?, TE dcr NN 1 ( - 2). 





ta 12 
Ver +a=t— = WE 


2t 2t 


= 4 


1 a 


On carrying out all these substitutions in the integral given, we have: 


de . 9t 1 ate dt 
ju eee a 


= log t +0 — log (x + Ya? +a) +0. 








6. The integral 
dg 


z2? a? 


is evaluated by means of a special method, fully described later: that of 
splitting the integrand into partial fractions. 
Having factorized the denominator of the integrand: 


a? — a! = (x — a) (x +a), 
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we write the integrand as a sum of simpler fractions: 


1 A B 
+ 


x at z—a ata” 








Constants A and B are found by clearing fractions: 
l= A(x + a) + Ble —a) = (4 + B)x+a(A — B), 


which must be true for any x. We thus find A and B from 

















a(A—B)=1, A+ B=0, A=-B=>. 
We thus have: 
1 1 1 1 
x?— a* Qa ET zril. 

[ dz  [( de |= 

IE x—a cal 
1 1 r—a 

= g log (x — a) — log ( + a)] +0 = 7 log a +C. 


7. Integrals of the more general type: 
f AE. 
a? + pa + q 


can be reduced to the forms already given by completing the square in the 
denominator of the integrand. We have: 


sorgen 
We now put: 

24 P- ap , dz-—dt, 
giving 

mr+n=m(t— 5p) +n=4t+B, 
where 


A =m and B=n— 5mp. 


We finally put 
p? 
qug CU 
where the (+) or (—) sign must be taken in accordance with the sign of the 
left-hand side of this equation, a being taken positive; so that we can rewrite 
the given integral as: 








me +n _ f At+B 
Jen T=] pga Y [Et fria 
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The first of these integrals is evaluated at once by putting: 


E +a =z; dt = dz, 
which gives 


1 1 
= = — = — 2 2 
¡A == E = -y log 2=-3-log ( + a). 


The second integral has the form calculated in Example 3 (for +) or 6 
(for —). 
8. Integrals of the type 
mz --n 
Va? + pz 4- q 
can be reduced to known forms by the same method of completing the square. 
Using the notation of Example 7, we can rewrite the integral as 


ma --n dz = At B ac 
lees ye rb 


— dx 





tdt di p? 
t +Bf- (b=+0- -). 
yt? +b J yt 4-6 q 
The first of these integrals is evaluated by substituting 

P+b=—2, 2tdt = 2zdz , 


The second integral was worked out in Example 5 and is equal to log (t 4- i-r) 
9. A similar method of completing the square can be used to reduce 


giving 


m+n 
Vg + px — a? 


dt 
MELLE 


do 0 yg—B840 
Yar—p 


dz 


to the form: 





where we have 


by using the substitution a? — t? = z?. The second integral is worked out 
in Example 4. 


— 1 1 
10. [simt ede = GE da y (2 y sin 2) +0= 


= 1 (x — sin x cos x) +0. 





2 
feos ada [LT Z da (e sin 22) + C= 


1 
= X (2 + sin æ cos x) J- C. 
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11. The integral 
SV Fade 
is reduced to a known form with the aid of integration by parts: 
fya Dade = rya Fa fed yA Fa = 


g? 
cart Fa fir à 





Adding and subtraeting a in the numerator of the last integrand, we can 
rewrite the above equation as 


[nasa [Yar Fadeta |. 
eta 





or 


_ de 
Vz Fa’ 


2 (VE Fads =z tataj 
whence finally: 
[V Fad eVa +a +a log (£x +/22 + a)]+0. 


93. Examples of differential equations of the first order. We considered 
some elementary differential equations in [51]. The general form of a first 
order differential equation is 


F(x,y, y) = 0. 


This is a relationship connecting the independent variable x, the unknown 
function y and its first derivative y’. The equation can usually be solved with 
respect to y’ and rewritten as: 


“== f(x,y), 


where f(x, y) is a known function of x and y. 

We only pause to consider a few elementary examples, the general case 
of the equation being dealt with in Volume IL. 

Equation with separable variables. The function f(z, y) is given here as the 
ratio of two functions, one of which only contains x, and the other only y: 


pa) 
97v) 


Since y” = dy/dz , we can rewrite this equation as: 


(22) 


p(y) dy = p(x) dx, 


so that we have only x on one side, and only y on the other side: this is re- 
ferred to as separating the variables. Since 


vy) dy = d f yy) dy, ple) de = d f p(z) de, 
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we have by property 1 [89]: 
J vty) dy = f ple) de +0, (23) 


whence the required function y can be found by integration. 

Examples. 1. Chemical reaction of the first order. Denoting the amount 
of substance at tho start of the reaction by a, and the amount taking part 
in the reaction at the instant t by z, we have the equation [51]: 


de 


d 79-9 (24) 


where c is a constant of the reaction. We also have the condition: 


t=0 
Separating the variables gives: 
da 


= cdt, 
a — x 





and integrating: 


dx 
a — g 





= fea +0,5 — log (a — x) = ct + C,, 


where C, is an arbitrary constant. Hence, 
dq eL, = Cot i 
where C = e™®: is also an arbitrary constant. The constant can be found by 
using condition (25), giving us a = C for t = 0 from the last equation; so 
that finally: 
& = a(l — 07%), 


2. Chemical reaction of the second order. Suppose amounts a and b of two 
substances, measured in gram-molecules, are in solution at the start of the 
reaction. Suppose that equal quantities x, of the two substances, take part 
in the reaction at time f, so that the amounts remaining of the substances 
are a — x and b — v. 

By the basic law of chemical reactions of the second order, the reaction 
takes place with a speed proportional to the product of the remaining amounts, 
i.e. 

de 
dt 








k(a — x) (b — x). 


This equation has to be integrated, with the initial condition 
€ lp =0. 
Separating the variables gives 


dz 


tan P 
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and integrating: 





dx 
(ri en 


where C, is an arbitrary constant. 
We use the method of partial fractions to evaluate the integral on the left 
(Example 6) [92]: 
A B 
(a — w) (b — x) a—zx T bx” 


] = A(b — x) + Bla — x) = —(A + B)x + (Ab + Ba), 























so that 
—(A + B)=0; Ab + Ba=1, 
whence 
1 
au. dun b—a 
Hence 
f dx m 1 f dg dz ] 
(a — x) (b — x) b—a a— zg b— g 
= 1 lo b—x 
ba aan 
bae 
log m = (b — a) kt + (b — a)€,, 
b—x — Co- akt 
a — g , 
where 
0 = %98, 


The required function of x is now found 
without difficulty. 

Fie. 121 We propose that the reader work out the 
special case of a = b, when the above formula 
becomes meaningless. 

3. To find all the curves intercepting radius-vectors from the origin at a 
constant anglej (Fig. 121). 
Let M(x, y) be a point of a required curve. We have from the figure: 





w =a — ð, 


SN 
tan a — tan 0 y 


1 + tan a tan 6 








tan w = tan (a — 0) = 


je 8 ja 


ly t 


T In general, the angle between two curves is defined as the angle between 
their tangents at the point of intersection. 
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Let tan œ = constant = 1/a; on multiplying up, we get the differential 
equation in the form: 
z + yy’ = a(y' x — y), 
or, multiplying both sides by dz, 
ade + ydy = a(zdy — ydz). (27) 


This equation is easily integrated, if we transform from rectangular co- 
ordinates z, y to polar coordinates r, 6, with OX as polar axis, and the origin 
O as pole. We have [82]: 





a? +y = 7°, 0 — aro tan Y, 
whence 
1 y sdy — yds 
ada + ydy = rdr, dô = yi d c cT yi 
1+ 4 


Equation (27) now becomes: 
rdr = ar? d@ i.e. 2 = adé. 
We get on integration: 
logr =a0+C,,r= Ce”, where C = eC. 


The curves obtained are called logarithmic spirals [83]. 


$ 9. Properties of the definite integral 


94. Basic properties of the definite integral. We have seen that the 
definite integral 


J f(x) da (1) 
is the limit of a sum of the form: 
n 
KE) — mou) (Ups < Ëk < Yy). (2) 
kal 


We have assumed here that a < b and correspondingly £k- < £y 
If we have a > b, integral (1) can be defined as before as the limit 
of the sum (2), except that now we have: 
a = To du AAN D>. a > En > Bn = D, 


ie. all the differences x, — z,., are negative. Finally, if we reverse 
limits a and b, ie. we take a as upper limit and b as lower limit, 
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the points 2, of the interval must now be taken in the reverse order, 
whilst all (rz, — 2-1) in (2) change sign, so that the sum itself and 
its limit change sign, i.e. 


a b 
(f(a) da = — (f(a) de. (3) 
b a 
Further, on interpreting the definite integral as an area, it is natural 
to take 


$ f(x) da = 0. (4) 
We also note the obvious equality: 

b 

fdr=b—a. (5) 


The function under the integral is here equal to unity for all x, so that 


f de= lim [(2, —a) + (£a — 2,) + (Z3 — 25) +.. -+ 


+ (£1 — £423) + (b — %p-1)1; 


but the expression in square brackets is equal to the constant (b — a). 
Evidently [87], (5) gives the area of a rectangle of base (b — a) and 
unit height. 

We can now start by noting three properties of definite integrals: 

I. The value of a definite integral with identical upper and lower 
limits is zero. ' 

II. A definite integral preserves its absolute value and merely changes 
sign, on interchanging the upper and lower limits : 


a b 
$ f(a) de = — f f(z) dz. 
b a 
For a < b, this property can be taken as defining the integral from 
b to a. It is naturally assumed that the integral on the right exists. 
III. The magnitude of a definite integral is independent of the notation 
for the variable of integration : 


b 


Sila) dz = 10) t. 


a 


This has already been explained in [87]. 
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The functions we consider will in future be assumed continuous in 
the interval of integration, unless there is a proviso to the contrary. 
IV. Given a series of numbers 


0,076; 4 lS 


arranged in any order, we have: 
1 b 
$ He) de = f er) ) de + ffe )dz +.. + Site) (6) 


It suffices to establish this formula for the case of three numbers 
a,b,c, the proof being then easily extended to cover any required 
number of terms. 

We first take a < b < c. We have by definition: 


f fle) de = lim S f(&) (e; — 24), 


this limit being the same, irrespective of how we divide the interval 
(a, c), provided only that the greatest of the differences (v; — vj. 1) 
tends to zero, and their number increases indefinitely. We can decide 
to divide (a, c) so that b, lying between a and c, appears as one of the 
points of division. The sum 


SN HE; i — ti) 

i=l 
can now be split into two sums of the same form, one being found by 
dividing the interval (æ, b) and the other by dividing (b,c), with 
the number of divisions increasing indefinitely in both cases, and with 
the greatest of the (x, — x;_,) tending to zero. These two sums will 


tend respectively to 
b 
f f(x) da and $ feas, 
a 
and we have finally: 


> 


f fe) de lim S f) (22) = f fede ffe 2) dz, 


i=l 


which it was required to prove. 
Now let b lie outside (a,c), say a < c < b. We can write in this 
case: 


h jen) de = f fla) de + f jæ) az, 
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whence 
€ b b 
JG) dz = fie) dz — f Ha) de 


But by property II: 


— $ dz = i f (x) dx 


i.e. we again have 


c b c 
Í f(x) dæ = jf (2)dz + | fa) dz. 
a a b 
All the other possible arrangements of the points can be considered 
in a similar way. 
V. A constant factor can be taken outside the definite integration 
sign, i.e. 
b b 
$ Af(z) de = A f f(x) dz, 
a a 
since 
b n 
f Af (x) dz = lim > Af (£) (£; — ti) = 
a i=} 
n b 
= Alim S AE) (z; — z) = A f fla) de. 
i=} a 
VI. The definite integral of an algebraic sum is equal to the algebraic 
sum of the definite integrals of each term, since, e.g. 


f Ufa) — pla)) de = lim S (HE) — oN (2, — 2,1) = 


i-1 


= lim "V HE) (2, — 2,.,) — lim S plé) (z, — Ti) = 
i-1 


i-1 


= fite x) dz — feto 


a 


95. Mean value theorem. VII. If functions f(x) and glx) satisfy the 
condition 


He) < pt) (7) 


in the interval (a, b), then 


Jfe)ax < Fo(eyae (6 a) (8) 
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or briefly, an inequality can be integrated. 
We form the difference 


J ole) ae — f ite) de = tote) — fla) dz = 


n 
= lim 2 LPE) — HEN] (ziti). 

By inequality (7), the terms under the summation are positive, 
or at least, not negative. The same can therefore be said of the whole 
sum and ofits limit, i.e. of the 
difference between the integrals, 
whence follows inequality (8). 

We also explain the above in 
geometrical terms. We first sup- 
pose that both curves 





y = f(x), y = plz) 
lie above OX (Fig. 122). Then 
the figure bounded by the curve y = f(x), OX and the ordinates 
x — a and x = b, lies entirely within the similar figure bounded by 
y = p(z), and hence the area of the first figure cannot exceed 
that of the second figure, i.e. 


j f(a) dz < fa) dz. 


The general case, with any arrangement of the given curves relative 
to OX, whilst preserving condition (7), follows from the above by 
giving the figure an upward displacement so that both curves appear 
above OX; this displacement adds the same term c to both functions 
f(x) and p(x), and the same rectangular area with base (b — a) and 
height ¢ to the area of both figures, so that the inequality remains 
valid. 

COROLLARY. If we have in the interval (a, b): 


Jæ) | < g) «M, (9) 
then 





b 
f He) dx 


In fact, (9) is equivalent to: 





< f eto) dz < M(b — a) (b — a). (10) 


—M < — pla) < f(x) < pla) < M. 
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Integrating these inequalities from a to 6 (property VIT) and using 
(5), we get: 
b b 
— M(b — a) < — Se) da < { f(x) dz < | 9(x) dz < M(b — a), 
a a 


which is equivalent to inequality (10). 
Setting g(x) = | f(x) |, (10) gives the important inequality: 





< f Ifa) ds, (10,) 





which is a generalization for the case of an integral of the well-known 
property of a sum: the absolute magnitude of a sum is less than or 
equal to the sum of the absolute magnitudes of the component terms. 
It is easily seen that the sign of equality is obtained in the above for- 
mula only in the case when f(x) does not change sign in the interval 
(a, b). 

An extremely important theorem also follows from property VII. 

Mean value theorem. If the function g(x) preserves its sign in the 
interval (a, b), then 


isl a) de = f(é ee) x) dz, (11) 


where E lies in the interval (a, D). 

For clarity, we shall take g(x) > 0 in the interval (a, b), whilst we 
denote the least and greatest values of f(x) in (a,b) by m and M 
respectively. Since clearly 


m « f(x) « M 
(the signs of equality being obtained simultaneously only when f(x) 
is constant), and g(x) > 0, then 
mo(x) < f(x) plz) < Mo(z) , 
and by property VII, taking b > a, 
( 


m | < {ie ) v (z) d. <M $ (2) 


a 


Hence it is clear that there is a number P, satisfying m < P < M, 
such that 


j I(x) p (x) de = P fe (x) da. (12) 
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Since f(x) is continuous, it takes in (a, b) all values included between 
m and M, one of these being P [35]. Hence there exists £ in (a, b) 
such that 

f) = 
which proves formula (11). 

If p(x) < 0 in (a, b), then —p(x) > 0 in (a, b). We get by applying 

the theorem just proved: 


JIO et x} de = f(8) fL e(2)] das 


on taking the (—) sign outside the sign of integration and multiplying 
both sides by (—1), we arrive at formula (11). 
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If b < a, we have from the above in exactly the same way: 
a . a 
J| fe) pla) de = f(5) f oa) de. 


Interchanging the limits of integration on both sides and multiply- 
ing by (—1), we arrive at (11), which is thus proved in general. 

On putting p(x) = 1, we obtain an important particular case of the 
mean value theorem : 


fre de = f(6) i de = (E) (b — a). (13) 


The value of a definite integral is equal to the product of the length of 
the interval of integration and the value of the integrand for some value 
of the independent variable lying in the interval. 

This length must be taken with the (—) sign if a > b. This propo- 
sition means geometrically that, given the area bounded by any curve, 
OX and two ordinates x = a and x= b, it is always possible to 
find a rectangle of the same area with the same base (b — a) and 
with height equal to some ordinate of the curve in the interval (a, b) 
(Fig. 123). 
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It is easily shown that the ¿ appearing in (11) and (13) can always 
be taken as lying inside (a, b). 


96. Existence of the primitives. VII. If the upper limit of a definite 
integral is a variable, the derivative of the integral with respect to the 
upper limit is equal to the value of the integrand at the upper limit. 

We note that the value of 


b 
] f(x) de 


depends on the limits of integration a and 5, given the integrand f(x). 
We consider 


E HA) dt 


with constant lower limit a and variable upper limit x, the variable 
of integration being denoted by t to distinguish it from the upper limit 
x. The value of this integral will be a function of x: 


x 
F (x) = | f(t)dt. (14) 
a P 
We have to show that 


e = f (a). 


We prove this by finding the derivative of F(x) directly from the 
definition of [45]: 


aF (a) 





n Fla +h) — F (a) 
de Hm h ; 
We have: 
x+h 
F(æ+h)=f = fF )dt 4- f ga 


(by property IV), whence: 
x+h 
F(eth)=F(x)+{ f(0t, 


x+h 


=>, f Hd. 


x 


and 





F (x + h) — F (2) 
h 
Using (13), we have: 


J'r0ear- (95, 
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where é lies in the interval (x, x + h); hence 


F h)— F 


As h tends to zero, £, lying between x and x + h, tends to x, and 
by the continuity of f(x), f(£) tends to f(x), so that 


dF (x) 4.) FP(a+h) — F (2) 
= cm h 


= lim f(£) =f (2), 
h-»0 


which it was required to prove. 

We note that h can only be given positive values for x = a, and 
only negative values for x = b (with a < b), whilst F(x) has the deriva- 
tive f(x) throughout (a, b) (closed). We have already discussed in [46] 
the definiton of the derivative at the ends of a closed interval. 

The corollary follows [45] that the definite integral F(x), considered 
as a function of the upper limit x, is continuous in (a, b), where we must 
take F(a) = 0. 

We remark that if we apply the mean value theorem to integral (14), 
we get F(x) = f(E) (x — a), whence it follows that F(x) — 0 as x > a. 
It also follows from the above discussion that: 

IX. Every continuous function f(x) has a primitive or indefinite 
integral. — 

Function (14) is the primitive of f(x), which vanishes for x= a. 

If F(x) is one primitive, then as we saw in [88]: 


(Ha) de = F, @) — F, (a). (15) 


a 


97. Discontinuities of the integrand. It has been assumed in all the 
above discussions that the integrand f(x) is continuous throughout the 
interval (a, b) of integration. ; 

We now introduce the concept of integral for various discontinuous 
functions. 

If the integrand f(x) has a discontinuity at a point c of the interval 
(a, b), whilst each of the integrals 


f To dz, i ()dr ^ (a«b) 


tends to a definite limit as the positive numbers e” and e” tend to zero, 
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these limits are referred to as the definite integrals of f(x) between (a, c) 
and (c, b) respectively, i.e. 


¿Leda Dm] rises 


b b 
$ f(x) dz = lim f f(2)de, 
e £0 ete” 


if these limits exist. 
We take in this case: 


b € 


d f(x) de = jf (x) de + Si (x) de. 


The function F(x) defined by (14) is easily seen to have the following 
properties: 
F’(x) = f(x) for every point of (a, b) except x = c, and F(z) is 
continuous throughout (a, b), excluding x = c. 
: If c coincides with one end of (a, b), only one of the limits has to be 
considered, either 


lim i" (z)dz or lim. nar (a) de 
s>r+0 a 


e>+0 a+8 


Finally, if we have more than one point of discontinuity c in (a, b), 
the interval must be divided so that there is only one point of 
discontinuity in each interval. 

Having agreed to attach the above meaning to the symbol 


b 
UELS 
property IX and formula (15): 


ff (2) de = F, (5) — F, (a) 


will certainly be valid if F’(x) = f(x) for every point of (a, b) except 
z =c, and F(x) is continuous throughout (a, b), excluding x = c. 

It is sufficient to prove this assertion for one discontinuity c inside 
(a, b), since the case of several discontinuities and the case of c = a 
or c = b follow in a similar way. 
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Since f(x) is continuous in the intervals (a,c — e”), (c + e”, b), 
we can apply (15) to these intervals, which gives: 


c 


$ 7j (x) dx = F (e — &) — F, (a), 


[tede = F, È) — Pyle +e"). 


e+e 


We can write, by the continuity of F(x): 


f (2) dz = lim [F, (c — &) — F, (a)] = Fy (c) — F, (a), 


a +0 
(He) de = Jim LF, (0) — Fic) = F0) — Fi 0), 


i.e. 
M G)de— f1()àz- ff (eax = 
= LF; (0) — Fa (a)] + LF (©) — F: (01 = F, ©) — F, (a), 


which it was required to prove. 


The case in question is encountered geometrically when a curve y = f(x) 
has a discontinuity at a point c yet the the area under the curve always exists. 
Take, for instance, the graph of a function 
defined as follows: Y 


fier fo 0O<a<2, 


Ha)=zx for 2<x<3 


(Fig. 124). 

The area bounded by this curve, OX, 
the ordinate « = 0 and the variable 
ordinate x = æ, is a continuous function 
of x, in spite of the fact that f(x) has 
a discontinuity at z= 2. On the other 
hand, it is easy to find a primitive of 
f(x) that will be continuous throughout 
the interval (0,3). Take, for instance, the function F,(x) defined as follows: 
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F,@=D4+ fe for 0<xx<2, 


2 
F, (2) ==> for 2<x<3. 
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We find, in fact, on differentiating: 


1 1 
F((1)=30+>3< 


in (0,2), and Fix) = g in (2,3). Furthermore, the two expressions for (x) 
give the same value 2 for x = 2, which ensures the continuity of F,(x). 

The area bounded by our curve, OX and the ordinates x — 0 and 
a = 3, is given by: 


o] o 


, 


3 2 3 
§ folds — [1 (det [1(0) de = F, (3) — F, (0) = 


which is easily checked directly from the figure. 
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We take further the function y = x72’ (Fig. 125). It tends to infinity for 
x = 0, but its primitives remain continuous for this value of x, one primi- 
tive being 3x1'2; hence we can write: 


+1 . atti 
$ eda = 3218 L6; 
—1 -i 


in other words, although the curve in question rises infinitely as x approaches 
zero, it still has a perfectly definite area between the ordinates  =—1 and 
a == 1, 

The primitive (— 1/x) of the function l/æ? itself tends to infinity for 
x = 0, so that (15) cannot be applied for this function in the case of zero 
lying inside the interval (a, b); the curve of 1/x? does not possess a finite area 
for such an interval. 
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98. Infinite limits. The preceding discussion can be extended to the 
case of an infinite interval, on taking 


Y 19 de= lim fI) de, (16) 
fo de = lim | f) dz. (17) 


provided these limits exist. 

This proviso is certainly fulfilled if the primitive F,(x) tends to a 
definite limit as x tends to (+ c») or (— ©). Denoting these limits 
directly by F,(+ œ) and F,(— co), we have: 


oo 


SK) de = lim [F, (b) — Fy (a)] = F (+00) — F, (a), — (18) 
[n 


b>+0w 


(2) de = lim [F,(5) — F()] = Fib) — Fi (— e), (19) 


—t6 ü—— 


((a)de= | f(z)da + j f(x) de = F,(-+ œ) — F, (— co), (20) 


+ 


eo 


8 


this last being a generalization of (15) for the case of an infinite 
interval. 
Relationship (16) is often written as 


J f(x) dz = lim ffe) 
a b>+o a 

Geometrically speaking, fulfilment of the above proviso means that 
the infinite branch of the curve y = f(x), corresponding to z — + co, 
has an area. 

We have thus extended the concept of definite integral, originally 
established for a continuous function and a finite interval, to the 
cases of a discontinuous function and an infinite interval. This exten- 
sion is characterized by first finding the integral of a continuous func- 
tion for a shortened interval, then passing to the limit. Integrals 
obtained in this manner are distinguished from primitives by being 
referred to as improper integrals. 

We note that the integral of a discontinuous function in a finite 
interval has in some cases a direct significance as the limit of a sum 
(cf. [94]). We shall discuss this later [116]. This is the case, e.g., with 
the integral expressing the area shown in Fig. 124. In essence, there- 
fore, this integral will not be improper If, however, the integrand 
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is unbounded in the interval of integration (tends to infinity), or if 
this interval is infinite, the integral can then only exist in the im- 
proper form. 
Example. The curve y = 1/(1 + a?) extends indefinitely for x = +% yet 

always bounds a finite area with axis OX (Fig. 126), since 

+o 

[ de _ aret +e 1 ( 1 = 

; dax = pt (Oc Rm —a}=n 


— 00 


It should be recalled, when evaluating this integral, that an arbitrary 
value of the many-valued function arc tan cannot be taken; the function 
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must be defined as in [24] so that it becomes single-valued, i.e. its values 
lie between (— 2/2) and (+ 2/2); if this is not done, the above formula 
becomes meaningless. 


99. Change of variable for definite integrals. Let f(x) be continuous 
in the interval (a, 6), or in the wider interval (A, B), discussed below. 
Further, let p(t) be a single-valued, continuous function with a con- 
tinuous derivative y’(t) in the interval (a, $), where 


pla) = a and (f) = b. (21) 
We further suppose that g(t) does not move outside (a, b), or the 
wider interval (A, B), in which f(x) is continuous, when ¢ varies in 


(a, B). The function of a function f[p(t)] is now a continuous function 
of £ in the interval (a, p). i 


If, with. these assumptions, we introduce a new variable of integra- 
tion ¢ in place of a: 


«= e(t), (22) 


the formula for transforming the definite integral is: 


b 8 
dfe) de = Hte Oa. (23) 
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We prove this by introducing integrals with variable limits in place 
of those above: 


Fa) = [fody PO = {fol (dz 
By (22), F(x) is a function of a function of t: 
Fea) = Field] = jf) ay. 


We find its derivative by the rule for differentiation of a function of 
a function: 








dF(z) _ dF(x) dz 
de "de  dt' 
whilst by property VIII [96]: 
dF 
dc). — Has 
it also follows from (22) that 
d 
a =) 
whence 
dF (a) 





x = He) p (D =H) v" (0 


We now find the derivative of Y(t). We have by property VIII and 
the assumptions made: 
av 2 
EO. = fte(t)] o^ (9. 
Functions Y(t) and F(x), considered as functions of t, thus have the 
same derivative in the interval (a, 8), and hence [89] can only differ 
by a constant; whilst we have for t = a: 





x = pla) = a, Pla) ia = Fla) = 0; Pla) — 0, 


ie. these two functions are equal for {=a and hence are equal for 
all ¢ in (a, B). In particular, we have for t = f: 


b B 
F(z) |p = F(b) = f(x) dx ay fle) v^ (t) dt, 


which it was required to prove. 
The inverse: 


= y(x) (24) 
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is very often used instead of substitution (22): 


x = (t). 
Limits a and f are then immediately defined by: 


a = yla), B = wb), 


whilst it must be borne in mind that expression (22) for x, obtained by 
solving (24) with respect to x, must satisfy all the conditions mentioned 
above ; in particular, y(t) must be a single-valued function of t. It can 
be shown that (23) is invalid if g(t) lacks this property. 


41 
We replace z in f dr=2 by the new variable t, where t = 2?, and 
ES 


obtain an integral equal to zero from the right-hand side of (23), since its 
limits are the same, --1; but this is impossible; the error arises due to 
the expression for x in terms of t: 


-ay 


being a many-valued function. 
Example. We call f(x) an even function of x if f(—=) = f(x), and an odd 
function if fí(—x) = —f(w). 
For instance, cos x is an even function of x, and sin an odd function. 
Wo show that 


+a a 
d f(x) dx = 2 d f(x) da, 


if f(x) is even, and that 
+a 
$ f(a) dz -- 0, 
—4 


if f(x) is odd. 
We separate the integral into two parts [94, IV]: 


+a 0 a 
J Ha) de = E Ha) de + 1 f(x) dz. 


We make a change of variable, v = —t, in the first integral and use pro- 
perties II and III [94]: 


0 0 a a 
d f(x) de = "E f(— 1) dte i K— 0 dt = $ (2) de, 
whence, substituting in the previous equation: 
+a a a a 
mar J f(— 2) de + J f(z) dz = d [H— 2) + K(2)] de. 


If f(z) is an even function, [f(—x) + f(2)] = 2f(x), whilst the sum in the 
square brackets is zero if f(a) is odd; and this proves our statement. 
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100. Integration by parts. In the case of definite integrals, the formula 
for integration by parts [91] can be put in the form : 


f u(z) de(z) = u(x) e(x) . — f v(x) du(z). (25) 


We integrate term by term the identity [91]: 
u(x) dv(z) = d[u() v(=)] — v(z) du(z) , 


and obtain, in fact: 


f ula) ) de(z = faut ) t(z)] — i v(x) du (x), 


a 


whilst by property IX [96]: 





, 


jaune 1-| Arale) 1 dz = u(x) v(x) : 


a 





and hence we get (25). Of course it is assumed that u(x) and v(x) have 
continuous derivatives in the interval (a, 6). 


Example. To evaluate: 
We put 


Integrating by parts, we have: 


1 
57 

I = | sin"! zsinz dz =— sin"^!zdcosz = 
d ò 


Ya 


ES 27 

SGH 2 "TS | 3, 1-2 x di -- 

= — sin £ COS Z + j (n ) sin æ COB T - cos v dr = 
0 o 


14 
7a 27 


alpino 25 cos? x dz = (n — 1) f sin" ?x(1 — sin? x) de = 
0 0 


a 
= (n — 1) f sin"? x de — (n — 1) M sin" zx dz = 
a 


n 


fS 
othe 


= (n — l) In- — (n — 1) In, 
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ie. 
In =(n— 1) In- — (n— l) In, 


whence, solving for In: 


—1 
doc 





Ls (26) 


This is called & reduction formula, since it reduces the evaluation of I, 
to the evaluation of a similar integral, but with a lower subscript (n — 2). 

We take the cases separately, of n even or odd. 

l. n = 2k (even). We have by (26): 











_ 2%—l (2% — 1) (2k — 3) _ 
Iy—— p Tats 2k(2k — 2) Tua 
_ Qk — 1) (2k —3)...3x1 | 
?k(2k — 2)...4-2 o 
and since 
1 
à j 
=| de =--%, 
0 
we have finally: 
g, —(k—1D(-3)...3x1 | a 
XOU 2k(2k—2)...4X2 2^7 


2. n = 2k + 1 (odd). We find, as above: 
i" 
2k(2k — 2)...4x2 
Tt = GEI 8k—1).5x3 ^" = fs idad 


1 
za 
2 = 1, 


0 


= — cos x 





and hence 
L, — 2(Qk — 2). ..4x2 
217 (9k + 1) (2k —1)...5x3 ` 





The integral 


Et 


a 


NS 


a 


cos" x da 


can be evaluated in a similar way, though it is easier to reduce it to the former 
by noting that 








^ z” 
po xdg = f sin” (5-22) as. 
0 
whence, putting 
1 1 
3 Tz—r-—lí cz EE a—t, 
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we have on the basis of (23) and property II [94]: 
1 
0 2 
f cos" z dz = — f sin” t dt = f sin” t dt. 
6 3 


Taking together the results obtained, we can write: - 


r in 
sin z da — | costado (2k — 1) (2k — 3)..3x 1 Lae (27) 
0 





2k(2k —2)..4:2 2 


SL, tole 


a 75 


2k(k — 2)..4x2 


2k+1 
A DDR” 





sin?^ x da = 


(28) 


SA nim 
St mi 


§ 10. Applications of definite integrals 


101. Calculation of area. We saw in [87] that the area bounded by 
a given curve y = f(x), by the axis OX and two ordinates x = a and 
x = b is, expressed by the definite integral: 


b 
$ f(a) da. 


a 


The area thus found, however, does not give us the actual sum of 
the areas, formed by the given curve with OX, but gives only their 


Y 


ao b 
Fic. 127 


algebraic sum, in which every area situated below OX takes the (—) 
sign. To find the sum of these areas in the ordinary sense, we have 


to find 
b 
$ If(z) | da. 


a 


Thus, the sum of the shaded areas in Fig. 127 is equal to 


e g h k b 
J fla) da — J f(x) de + ff()dz — [f(0) 03 + J f(a) de. 
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The area contained between two curves 


y = f(z), y = plz) (1) 
and two ordinates 
z=a x=b, 


where one curve lies below the other, i.e. 
f(x) > ple) 
in the interval (a, b), is given by the definite integral 


Ufer) — g(a) dz. e) 


We suppose first that both curves lie above OX. It is at once evident 
from Fig. 128 that the required area S is the difference between 
the areas bounded by the given 
curves with OX: 


S =f f(a) de — jet) da = 





= JU) — «(2))dz. 


Fic. 128 


which is what we required to 

prove. The general case, of any 
disposition of the two curves relative to OX, now follows by displacing 
OX downwards, so that both curves appear above OX; this displace- 
ment is equivalent to adding the same constant to both functions, 
with the difference f(x) — p(x) remaining unchanged. 

We propose as an exercise the proof that, if two given curves inter- 
cept, i.e. part of one curve is below the other, and part above, the sum 
of the areas contained between the curves and the ordinates x = a and 
x = b is equal to 


Site) — p)ar. 8) 


Evaluation of a definite integral is often called quadrature. This is 
because finding a definite integral often amounts to finding an area, 
as shown above. 


Examples. 1. The area bounded by the second degree parabola 
y =a + br +e, 
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the axis OX, and two ordinates, distant h apart, is equal to 


h 
€ (Y, + 9s + 440) (4) 


where y, and y, denote the outer ordinates of the curve, and y, lies equidistant 
from each of these outer ordinates. 
We assume here that the curve lies above OX. 


We can take without loss of generality the left-hand outer ordinate as 
along OY (Fig. 129) in proving (4), since displacement of the entire figure 
along OX changes neither the magnitude of the area in question, nor the 
relative disposition of the outer and centre ordinates, nor the size of these 
: ordinates. With this assumption, and with 
the equation of the parabola in the form: 


y = ax + be +c, 


TES 







Su, 





Fic. 129 Fra. 130 


we write the required area S as the definite integral: 


h 
S= f (axt + be + o) da =a EI +02 = 
9 0 


8 2 
a+ ch = È (ah? + 3bh + 60). 


We have from our notation: 


Y= ar? + br bo, 1, = 


yi =a? +be+0|,_¿=0, Ya = ant 4 bD + cl, = aÀ* 4. bh +e, 
whence it follows: 
Yı + Yz + 4Yo = 2ah? + 3bh + 6c, 


which proves our example. 
2. Area of an ellipse. An ellipse having the equation 

at y 

arg 


1 
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is symmetrical relative to the axes, and hence the required area S is equal 
to four times the area of the part lying in the first quadrant, i.e. 


a 
S=4 f ydz 
0 


(Fig. 130). Instead of finding y from the equation of the ellipse and substitut- 
ing the expression obtained in the integrand, we use the parametric form 
of the ellipse: 


x=acost, y=bsint, (5) 


introducing the new variable ? in place of x; y is now expressed directly by 
the second of equations (5). When x varies from 0 to a, t varies from 2/2 
to 0, and since all the conditions for 
Y the rule of change of variables [99] 

are fulfilled in this case, 


0 
S=4f b sin 1d (a cos t) = 
7/2 






nj2 
= — 4ab Í sin? t dt — dab j sin? £ di. 
aj2 0 


X 
We have for k = 1 from (27) [100]: 


f fd 1 1 oa 
ó sin: = 7° 7I FT’ 
Fig. 131 so that we finally get: 
S = aab. (6) 


With a = b, when the ellipse becomes a circle of radius a, we get the well- 
known expression za? for the area of a circle. 
3. To find the area contained between the two curves 


y=, x= y’. 
The given curves (Fig. 131) intercept at two points (0,0), (1,1), their co- 


ordinates being obtained by simultaneously solving the equations of the 
curves. Since we have in (0,1): 


Va > 2?, 


we find on using (2) that the required area S is given by 


s- fW- mas e (22-2) =>: 
0 
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102. Area of a sector. The area of the sector bounded by a curve with 
polar equation 


r=f(0), (7) 


06=a,0=8 (8) 


and the two radius vectors 


drawn from the pole at angles a and B to the polar axis, is given by : 
8 B 
1 1 
8 =|-r2d0= z f 11(6)12d0. (9) 


We deduce (9) by dividing the area in question into small elements 
(Fig. 132), the angle between the radius vectors (8) being divided 
into n parts. We take the area of one 
such small sector, bounded by radius 
vectors 0 and 6+ 40. Let AS denote 
this area, and m and M the least and 
greatest values respectively of r = f(0) 
in the interval (0, 0 + 40); then we see 
that AS lies between the areas of two 
circular sectors with the same angle 40 X^ — 
but with radii m and M, i.e. 


8*48 





1 1 Fia. 132 
and hence, denoting some number between m and M by P, we can 
write: 
AS = 5 P? A0. 


Since the continuous function f(0) takes all values between m and 
M in the interval (0,9 + 40), there must exist 0” in this interval 
such that 

f0) —P, 
so that now 
AS ==> [110")]? 40. (10) 


If we now increase the number of elementary sectors AS, so that 
thé greatest of the 40 tends to zero, and if we recall what was said in 
[87], we get in the limit: 


B B 
S=lim Y 48 —lim X 5 (/ (6)? 46 = | 5 OPA = [5 1240, 


which it was required to prove. 
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We remark that the basic idea in the above proof of (9) lies in 
replacing the area AS by the area of a circular sector of the same 
angle 40 and radius f(6’). If instead of the exact expression (10) we use 
the approximation: 


1 
AS = 3,740, 


where r — f(0") and 0” is any value from the interval (0,0 + 40), 
we get in the limit the same result for the area of a sector: 


B 
lim X; [/ (9)? 40 = [y *a6. (11) 


This leads us to a simple geometrical interpretation of the integral 
expression in (11): in dð is an approximate expression for the 


Y 





r= 260838 
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area of an elementary sector of angle d0, and hence is simply 
referred to as an elementary area in polar coordinates. 


Example. To find the area bounded by the closed curve 
r=acos 30 (a > 0). 


This curve, called a trifolium, is shown in Fig. 133 and is plotted without 
difficulty. The total area bounded by itis six times the area of the shaded 
part, corresponding to a variation of 6 from 0 to 2/6; so that we have from 
(9): 

një ajs n[2 
S= ef 1 a? cos? 36 dé = at f cos? 36 d(30) = a? J cost t di = 
0 0 0 


nal 
i 
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103. Length of are. Let AB be the arc of a certain curve. We draw 
a series of successive chords (Fig. 134) and increase their number in 
such a way that the greatest tends to zero. If the total length of the 
series of chords now tends to a definite limit, independently of how 
the chords are drawn, the arc is said to be rectifiable, and the limit in 
question is called the length of this 
arc. This definition of length is also 
suitable for a closed curve. 

Let the curve be given explicitly 
as y = f(x), and let A and B cor- 
respond to z — & and x= b re- 
spectively (a < b); let f(x) also have 
a continuous derivative in the in- 
terval a < x < b, to which the arc Fie. 134 
AB corresponds. We show that the 
arc AB can be rectified in these conditions, and that its length is 
expressed by a definite integral. 

Let the chords be AM,, M,M,, ..., M4 ,B, and let the coordi- 
nates of their ends be 





a = d. TQ L Tg L... Ep <p =H, 


and y; = f(x). Using the formula of analytic geometry, we have for 
the total length of the chords: 





p= XV ra) 3G wa = 


i=1 





= Sy aay + Qe) Fea) 


i=l 


Using the formula for finite increments: 


f(z) — F (2i) = F (6) (01 — ia) (rj, < & < v), 
we get for the length of a single chord 


from which we see that the requirement that the greatest chord tends 
to zero is equivalent to the requirement that the greatest of the 
(x; — x;_1) tends to zero. We now have 


p= STF PEN na). 


tel 
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which in fact has a limit equal to the integral 
STEP Cer. 
Thus, the length / of arc AB is given by: 
i- STEP as. (12) 


Let 2’ < x” be any two values of x in the interval (a, b), and let 
M’, M" be the corresponding points of arc AB. Using the mean value 
theorem, we get the following expression for the length l’ of arc 
M'M*: 


U = SY J- f? (0) de = V1 + 2 (&) (z" — 7) (a^ < E, <2’). 


Using the formula for finite differences, we get for the length of 
chord M'M”: 
MWM” = Vo —a + if (@") ENE = 
= 1 +f? (&) (1" — 2) (z' < & « a"). 


Hence it follows: 





mwm MWrft) 
k FPE) 
If M’ and M” tend to M with abscissa x, x’ and x” tend to x, and also 
£, and £, — z, and we get from the above formula: 


MM” 
P4 





—> l. 


We made use of this in [70]. 
We now suppose that the curve is given parametrically, 


x = p(t), y= wt), 


where A and B correspond to t = a and t = f (a < p). We assume 
that points of the curve AB correspond to t in the intervala < t < f, 
with different points for different t, and with the curve neither closed 
nor intercepting itself (Fig. 134). We assume further that there exist 
continuous derivatives p'(t) and y’(é) in the interval a < t < B. 

Let chords be drawn as above, AM,, M,M,, ..., M, ,B, with 
their ends corresponding to £j =a « t4 « f <... « tn < tn = D. 
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We have for the total length of the chords: 


p= > CO Ga 4 (v (6) — 9 6* 





or, using the formula for finite increments, 


n ET 
p= 2 Vp) + y (7) (hi — tea) (tin T< t) and (t1 v t) (13) 
= 

It can be shown that the requirement that the greatest of the chords 
tends to zero is equivalent to the requirement that the greatest of 
(€, — t;_1) tends to zero. This can be proved without assuming the 
existence of derivatives y’(t) and yp”(t). 

. Expression (13) differs from the sum giving in the limit the integral 


i —————————— 
Je? E Ad, (14) 


since the arguments v; and 7; are in general different. We introduce 
the sum 
Ww D ——————— 
I= le^ (v) + Y? (7) (tj — t5); 
ici 
which gives integral (14) in the limit. To show that (13) tends to the 
limit (14), we have to show that the difference 


p—a= SU) X v) — 9*0) 9 v* Gl ta) 
iml 


tends to zero. 
We multiply and divide by the sum of the radicals, and get 


n 








= y' (x) + (n) 
PRI E a Yo? pane (ED + Vp? (1) + y? (v) A A 
(7) (t — ti). 
Since 


(y (9) +w (=) < Vo? (c) + v^ (ti) + Yo? () + v? (x), 
we have 


lp =al < Žir (7) — Y (T) | (& — ta). 


The numbers v; and 7; belong to the interval (£;_,, £j), and we can say, 
by the uniform continuity of y”(t) in the interval a < t < £, that the 
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greatest of the | y"(r;) — v'(vj)) |, which we denote by 6, will tend to 
zero if the greatest of the (t; — tj...) tends to zero. But we have from 
the above formula: 


Ip— al< X4 (t; — tia) =5 S(t, — ta) = ô (£ — o), 
fxs} 
whence it is clear that p — q — 0. Thus the total length of the chords 
given by (13), tends to integral (14), i.e. 


l= 4 Ve? (t) + v^? (t) de. (15) 


This formula for the length / remains valid in the case of a closed curve. 
This can be shown, for instance, simply by dividing the closed curve 
into two open curves, applying (16) to each, and adding the values of 
l obtained. Similarly, if a curve Z consists of a finite number of curves 
L,, each with a parametric form that satisfies the conditions given 
above, the length L, of each can be found from (15), and the total 
length L obtained by summation. 

We take t variable in the interval (a, $), with the corresponding 
point M varying over the arc AB. The length of arc AM will be a 
funetion of t, given by 


t A N ——— 
s (t) = d Ve? (t) + v? de. (16) 


We get by the rule for differentiation of an integral with respect to 
its upper limit, 

ds 

B= Ye Oty, (17) 


whence, noting that 
, de , d 
¥O=gq ¥YO=— Gp 
we get the formula for the differential of an are [70]: 
ds = (dx)? + (dy), 
whilst (15) can be written without indicating the variable of integra- 
tion, in the form: 


(B) (B) 


l= {ds= f / (dz)? + (dy)? 


(A) (A) 
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Limits (4) and (B) indicate the initial and final points of the arc. 
If p(t) + p(t) > 0 for all t in (a, B), (17) gives us the derivative 
of the parameter t with respect to s: 


dt _ 1 
TEO 
The existence of continuous derivatives g'(t) and y’(t), together 
with the condition y(t) + yp”2(£) > 0, ensures a tangent varying con- 
tinuously along AB. 
If the curve is given in polar coordinates by the equation 


n f (0) , 
we can introduce rectangular coordinates x and y, related to r and 0 by 
xz = r cos 0, y —rsinO (18) 


[82], then take these equations as a parametric form of the curve 
with the parameter 0, 
We then have: 
dx = cos 0 dr — r sin 0 dé; 


dy = sin 0 dr + r cos 0 d0; 
dæ? + dy? = (dr)? + r2(d0)?, 


ds = / (dx)? + (dy)? = / (dr)? + 7? (d6)? (19) 
and if A, B correspond to values a, f respectively of the polar angle 0 
(Fig. Nn (15) gives us: 


ZU EC 


Expression (19) for ds, called the 
differential of an arc in polar coordi- 
nates, can be got directly from the 
figure, by replacing the infinitely 
small arc MM” by its chord, then 
taking the chord as the hypotenuse 
of the right-angled triangle MNM’, 
the adjacent sides of which, MN 
and NM”, are respectively approximately equal to rd6 and dr. 


whence 








Examples. 1. The length of are s of the parabola y = x?, measured from 
the vertex (0,0) to a variable: point with abscissa x, is given by (12) as the 
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integral: 
x x 2x 
s= [VIF Vias = [Vr istas = > [VFB de (21) 
0 0 0 


(substituting ¢ = 2x). 
We have by Example 11 [92]: 


[isa 4 VIF 85 - log (t - Yl-E28)] 4- C. 
Substituting this in (21), we easily obtain: 
s [2V1 + 42? + log (2a + VI + 423)]. 


2. The length of the ellipse 


x? y? 
wot b 
is equal, by its symmetry with respect to the axes, to four times the length 
of the part lying in the first quadrant. We use the parametric equations of 
the ellipse 

gz = a cost, y —bsint, 


and note that variation from A to B means variation of the parameter from 
0 to 2/2; we then get, by (15), the following expression for the required 
length 1: 
n[2 
Imad f Va? sin? t + b? cos? t dt. (22) 
0 


This integral cannot be evaluated in an explicit form; a method of approxi- 
mate evaluation, given later, has to be used. 
3. The length of are of the logarithmic spiral 


r = Ce 


[83], eut off by the radius-vectors 0 = a, 0 = B, is given by (20) as the 
integral: 


B B — 
2 —— 
fle ($) d@ = C yI Fa [ooo gg = LE [0% — ea], 
a a 

4. Let M(z, y) be a point of the catenary considered in [78]. We find the 
length of arc AM (Fig. 93). Using the expression for (1 + y”) in [78], we 
get: 

x x 1 x 
AM = | TF0 ds = f -dz = IÓ + 0740) dy = 
ó 0 ô 
= eto) a 


^ 
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whence 
at + (are AMP = a! + aè y” = a(l + y?) = g, 


ie. the length of are AM is equal to the adjacent side of a right-angled tri- 
angle with hypotenuse equal to the ordinate of M, and the other adjacent 
side equal to a. This gives us the following rule for finding the length of 
arc AM: 

With the vertex A of the catenary as centre, describe a circle with radius 
equal to the ordinate of M ; let the circle cut OX in Q ; then OQ is the rectified 
arc AM (Fig. 93). 

We have been guided in the choice of sign in the above formula by the 
fact that y’ is positive for points lying on the right-hand side of the catenary. 

5. We take the cycloid of [79] and find the length of arc 1 of the branch 
00” (Fig. 94), and the area S bounded between this branch and OX: 

25 2a 
pe] Vie’ OP + (v (0] de 2 Ya? (1 — cost)? + a? sin? t dt = 





2n 2n 


2n 
=a f y2 —Eoosidt- a f |fa ios ca zo f sin rc 
0 0 0 


1 2a 
=20| — 2008 3-4 ] == 8a, 
2 0 
i.e. the length of arc of one branch of a cycloid is eight times the diameter of the 
rolling circle ; 


S= {yd T Y wo y (t) dt = a? fa — cos t)? dt = 
0 o ó 


27 2 
a= a? $ (1 — 2 cost + cos? t) dt = 2na? — 2a* [sin ON + 
0 


QD l . 9 an 27a? 2 — 3na? 
+0 [5-6 sin ‘|, = 2na? + na? = 3na?, 
i.e. the area bounded by one branch of a cycloid and the fixed line along which 
the circle rolls is equal to three times the area of the rolling circle. 

We had to take the (+) sign with ys sin? ££ infindingl, since sin $t is 
positive with ¢ varying from 0 to 27. 

6. The cardioid of [84] is symmetrical with respect to the polar axis 
(Fig. 111), and its length Z can therefore be found by doubling the length of 
are for 0 varying from 0 to a: 





= 2 f yr? F rdo = 2 f 4a? (1 + cos 6)? + 4a? sin? 0 dé = 
0 0 


f 1 , 1 m 
= 8a [ cos -0d0 = 8a [25in 5-6 | = 16a, 
à 2 2 0 


i.e. the length of arc of a cardioid is eight times the diameter of the rolling 
(or fixed) circle. 
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104. Calculation of the volumes of solids of given cross-section. The 
calculation of the volume of a solid also leads to evaluation of a defi- 
nite integral, when the area of the cross-section, perpendicular to a 
given direction, is known. 

We call the volume of the given solid V (Fig. 136) and we assume 
that the cross-sectional area of the solid is known in planes perpendi- 





cular to a given direction, which is taken as OX. Each cross-section 
is defined by the abscissa x of its point of intersection with OX, so that 
the cross-sectional area is a function of z, say S(x), which we assume 
known. 

Further, let a, b denote the abscissae of the extreme sections of 
the solid. We find V by dividing the solid into elements, by means 





of a series of cross-sections starting at x — a and ending at x — b 
we take one such element 4V between the cross-sections with abscissae 
x and æ + Ax. We replace AV by the volume of a right cylinder of 
height Ax and base equal to the cross-section of the solid at x (Fig 137). 
The volume of this cylinder is S(z) 4x, so that the required volume V 
will be given approximately by 


X S(x) Ax, 
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where the summation is over all the elements into which the solid 
has been divided. On indefinitely increasing the number of elements, 
and with the greatest of the Ax tending to zero, the above sum becomes 
in the limit a definite integral, accurately expressing V; this leads to 
the following proposition: 

If the cross-section of a solid is known for all the planes lying per- 
pendicular to a given direction, which we take as the axis OX, the volume 
V of the solid is given by : 


V= fs(2) dx, (23) 


where S(x) denotes the area of the cross- 
section of abscissa x, and a, b are the 
abscissae of the extremities of the solid. 





Example. To find the volume of a cylin- 
drical ungula, cut from a right circular Fia. 138 
cylinder by a plane passing through a di- 
ameter of its base (Fig.138). We take the 
diameter AB as OX, with A as origin; we let r denote the radius of the 
base of the cylinder, and a the angle between the cutting plane and the base. 

A cross-section perpendicular to AB consists of a right-angled triangle 
PQR, the area of which is: 

l =~ 55 1 


S (x) =-7 PQ: R = -> tan a. PQ?. 


Further, we know from the properties of circles that PQ is the geometric 
mean of AP, PB, ie. 


PQ? = AP . PB = a(2r — a), 


so that finally, 


S(x) = agr — x) tan a. 


Using (23), we get for the required volume V: 





2, 2 
da =! dic tone free) 0" 
v= | sc x=- tana far — 2) — na (rs 5). 
0 0 


2 2 
me 3 zi 2 
E 13 tana 3 h, 


where the “height” of the ungula k = r tan a. 


270 INTEGRATION: THEORY AND APPLICATION [105 


105. Volume of a solid of revolution. When the solid in question is 
obtained by revolution of a given curve y = f(x) about OX, its 
cross-section is a circle of radius y (Fig. 139), and hence 


S(x) = ny, 


b 
V (2) == $ ny? dz, 


i.e. the volume of the solid, obtained by revolution about OX of the part 
of the curve 


y = f(x) 








Fra. 139 Fre. 140 


includ ed between the ordinates x = a and x = b, is given by: 
b 
V = (ay* dz (24) 
a 


Example. Volume of the ellipsoid of revolution. We get by revolution of 
the ellipse 





a? 


about its major axis the prolate ellipsoid of revolution. (See Fig. 140). The 
extreme abscissae x are (—a) and (+a) in this case, so that (24) gives: 





+a +a A 
Yoo = | yide=a | n(1-G)ar= 
—a —a 
as y {+a 4 
= nb? (s — 3] M -g "ab. (28) 





The volume of the oblate ellipsoid of revolution, obtained by revolution of 
the ellipse about its minor axis, is found in the same way. It is only necessary 
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to interchange x, y and a, b, which gives: 


+b +b , E 
Va f dy =n f e (1 — A) dy - abat (26) 
—b —b 


When a = b, both ellipsoids reduce to a sphere of radius a, with volume 
1. ras, 
3 


106. Surface area of a solid of revolution. When a solid is obtained 
by the revolution of a curve, given in the XOY plane, about the axis OX, 
its surface area is defined as the limit of the surface area of a second solid, 
the second solid being obtained by revolution of a successive series of chords, 





Fic. 14 


inscribed in the original curve, also about OX, the limit being taken when 
the number of chords increases indefinitely, and the greatest of them 
tends to zero (Fig. 141). 

If the part of the rotating curve lies between A and B, the surface 
area F is given for the solid of revolution by : 


Pa | mjd (27) 
== 7 $, 
dm 


where ds is the differential of the arc of the given curve, i.e. 


ds = (d+ (dy)? 


The curve may be given in any convenient form in the above for- 
mula, either explicitly or parametrically; symbols (4), (B) indicate 
that integration is between the limits of the independent variable 
corresponding to the given points A, B of the curve. 

We shall assume that the equation of the curve is given parametric- 
ally, with the length of arc s as parameter, measured from the point A; 
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let | denote the total length of curve AB. The curve is of course 
assumed to be rectifiable. We have: x = p(s), y = y(s). We divide the 
interval (0,7) of variation of s into sub-intervals in the usual way, with 


O= 8) <8) OR a < du mu 


Let s = s; correspond to the point M, of the curve, with M , evidently 
coinciding with A, and M, with B. Let q, denote the length of the chord 
M,_, Mı, and As; the length of the arc M,_, M;, and let y; = y(s;). 
Using the formula for the lateral surface of the frustrum of a cone, 
we have the following result for the surface area obtained by revo- 
lution of the series of chords AM, M, ... My_, B: 


E an Xl. Yi + Yi) 
iml 
or 


Q= 2r Fygi ta S| Yi — Jii) d 
i=l 
Let ô be the greatest of the | y; — Yi-ı |. By the uniform continuity of 
v(s) in 0 < s < l, ô tends to zero if the greatest value of (s; — sj.) 
tends to zero. But we have: ; 


$o- Yi-1) d a| < PULL, 


whence it follows that the second term in the expression for Q tends 
to zero. We consider the first term by rewriting it as: 


n n n 
an È Ymi = mx Yi— As; — 2x > yia (4s; — 41). 
pa =] i1 


We show that the second term on the right-hand side tends to zero, 
by remarking that y(s), continuous in (0, 7), is bounded, so that there 
exists a positive m, such that | y;_, | < m for all i. Hence: 


n n 

4-148; — q) |< Sms — q )=m[1— Sal. 
i-1 i=l i=l 

But if the greatest of the (s; — s;.;) values tends to zero, the greatest 

of the chords g; also tends to zero, and the total length of the chords 

tends to the length of the are: 


n 
Suh, 


i=l 
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whence 


n 
2 > Yi (4s; — q) > 0 
i=l 


The only remaining term to be investigated in the expression for Q 
is thus: 


n yin do = 2% Sis) i sea). 


i=l 


But the limit of this sum gives us integral (27). Our formula has thus 
been proved. If the curve is given in terms of a parameter t, we 
have [ef. 103]: 


F= j 2wy(t) (t) Yo? (t) ? (t) dt (28,) 


and in the case of the explicit y — f(x) equation for AB: 


F = i 2x f (2) V1 + f? (2) (282) 


Example. Surface area of prolate and oblate ellipsoid of revolution. 
We take the prolate case first. Using the notation of the example of [105], 
we have by (28): 


a a 
Foro mn j yl + y? dx = 2x f yy? + (yy)? dz. 
—a —a 


We have from the equation of tho ellipse: 








a A Ba 
y=0 (1-7), WS == ra 
whence 
A bia? 
(ui Y = — a> 








Fro 2a Sp b zi pi = ee 0 = à - Eas. 


We introduce here the eccentricity of the ellipse 


al — b2 


a? 





el = , 
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and get (cf. Example [99]): 
F pro = 22b f Y jore ~f iE Put ds 
—a 
a 7 e 
fp (=) a(=) |. 4zab [ica 
€ a a E 
0 0 
Integrating by parts, we have (cf. Example 11 [92]): 


D 














"yat 
Pss aen sar | E L; 


| Yl — tdt => yI — & + are sin t}, 











whence 





and finally 
Foro = 2nab [ie a Em] e 


This formula is also valid in the limit for e = 0, i.e. when b = a, and the 
ellipse reduces to a sphere of radius a. The square brackets contain an indeter- 
minate form in this case, which may be evaluated as [65]: 


arc sin € EM yl =e 


€ 


zm]. 
e=0 








8-0 
We now turn to the oblate ellipsoid of revolution. We interchange z, y, 
and a, b, and find: 


bL 
Fo = 22 f Va? + (20) dy, 
=b 


where x is taken as a function of y. 
But we have from the equation of the ellipse: 

















2 , a? aty? 
ze ey, zz’ = — ie , (an) a , 
whence 
+b RU PI EE A 
z 2 
Fa mino | + E (5 acen] Ji 14 EE dy = 
>o 
ae[b aejb 


men yis dez P ure + #+ log (+VII F 2)] 


b? | as a? ct ae j 2 g? 


_ 95, BF | ae 2b? a (1 4- &) 
mS + ES +|% |] = 2. 25 log b j 





0 
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and finally: 
z 
Fop = 2na? + Im log sers. (30) 


107. Determination of centre of gravity. Guldin's theorem. Given a 
system of n point-masses at 


My Y1), Min Yo), -- «> MnlCn> Yn) » 


with respective masses 
M1, My, +++, Mn, 


the centre of gravity G of the system is defined as the point whose 
coordinates xg, yg satisfy the conditions: 


n n 
Mzx¿= Nm,x,, Myg= > MY, B 
e 


i=l 


where M denotes the total mass of the system: 
n 
M = > Mi: 
i=1 


The points of a system may be arranged in any desired manner 
when finding its centre of gravity G; the aim is to group the points 
in such a way that the points of any 
one group may be replaced by a 
single point at the centre of gravity 
of the group, the mass at the point 
being the total mass of the group. 

We shall not dwell on the proof of 
this general principle, since it presents 
no difficulty and can easily be veri- 
fied for the simplest particular cases Fra. 142 
of systems of three, four points etc. 

We shall be concerned in future, not with systems of points, but 
with the continuous distribution of mass over a plane figure (domain) 
or on a line. 

For simplicity, we limit our consideration to homogeneous solids, 
the density of which we take as unity, so that the mass is equal to 
the length in the case of linear distributions, and to the area for plane 
distributions. 

Suppose first that it is required to find the centre of gravity of an 
arc AB of a curve (Fig. 142), the length of which is s. Following the 
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above general principle, we divide arc AB into n small elements As. 
The centre of gravity of the total system can be found by replacing 
each of these elements with its centre of gravity at which is con- 
centrated the total mass of the element Am = 4s. t 
We take an element As, and let the coordinates of its ends be 
(x, y), (x + Av, y + Ay); the coordinates of its centre of gravity 
are denoted by (x,y). We can suppose that the point (x,y) differs by 
as little as may be desired from the point (x, y) on sufficiently dimi- 
nishing s. 
We have by (31), as in [104]: 
B 
Mag = stg = 3 tAm = X tds = lim Y eds = feds, (32) 
(4) 


(B) 
Myg = $yg = X YAm = X y As = lim X yAs = f yds, (33) 
(4) 


whence, on obtaining s from the formula: 


(B) BA 

s= [ds- f Vd)" + (dy? , 
(4) (A) 

we find the coordinates of the centre of gravity G. 

An important theorem follows from (32) and (33): 

GULDIN'S FIRST THEOREM. The surface area of a solid, obtained by 
revolution of the arc of a given plane curve about some non-intersecting 
axis in the same plane, is equal to the product of the length of arc and 
the path described on revolution by the centre of gravity of the arc. 

Taking the axis of revolution as OX, we have for the surface area F 
of the solid obtained by revolution of arc AB (using (27) [106]): 

(B) 
F = 2n f yds = 22Yy5*8 
(4) 
[by (33)], which it was required to prove. 

We now take a plane domain S, its area being also denoted by S. 
We suppose for simplicity that the domain (Fig. 143) is bounded by 
two curves, the ordinates of which are denoted by 


yi = f(x), Ya = Sex). 
j The centre of gravity of an element does not in general lie on the curve, 


though it approaches closer to the curve, the smaller the element. This is 
indicated schematically in Fig. 142. 
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In accordance with the general principle mentioned at the beginning 
of this article, we divide the figure into n vertical strips 48 by lines 
parallel to OY. We can find the coordinates of the centre of gravity G 
by replacing each strip by its centre of gravity, associated with its 
mass Am = AS. We take the strip bounded by M,M, and Mi Mj, 
with abscissae x and x + Az, and let its centre of gravity be (x,y ). 

On sufficiently reducing the 
breadth Ax of the strip, (x, y) will 
differ as little as desired from the 
centre P of M,M,, so that we can 
write the approximate equations: 


$c, go hte, 








Fre. 143 


Further, the mass 4m of the 
strip is equal to its area AS and so 
can be taken as approximately equal to the area of the rectangle of 


base Ax and height differing by as little as desired from M,M, = 
= Y, — Yy ie. 


Am ~ (y, — Yı) Ax. 


Using (31), we can write: 


b 
Mag = Sag = X TAm = lim Y [e (Ya — y,)] 4e = fz (ya — y) de, (34) 
Mys = 8yo = X Jám = lim X (55) (y, — 9) 4 = 
1 d 1 
— lim X [5 98 — v0] 42— [5 (8 — 13) de. (85) 


GULDIN’S SECOND THEOREM. This follows from (35): 

When a plane figure revolves about an axis in its plane which does 
not cut the figure, the volume of the solid obtained is equal to the product 
of the area of the figure and the length of path resulting from the revo- 
lution of the centre of gravity of the figure. 

We take the axis of revolution as OX, and notice that the vol- 
ume V of the solid ofrevolution in question is equal to the difference 
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between the volumes of the solids obtained by revolution of the 
curves y, and y,, so that, in accordance with (24) [105]: 


b b b 
V =x (yi dz — a f yide = x ((y3 — yj) de = 2nyg- 8, 
a a : a 


by (35), which is what we required to prove. 








Fic. 144 


The above two theorems of Guldin are extremely useful, either for 
finding the surface area or volume of a solid of revolution, when the 
centre of gravity of the revolving figure is known, or conversely for 
finding the centre of gravity of a figure when the volume or surface 

area of the solid obtained by its 
Y revolution is known. 


Examples. 1. To find the volume V 
of the anchor ring (torus) obtained by 
revolution of a circle of radius r (Fig. 
144) about an axis in its plane, distant a 

D X from the centre (with r <a, ie. the 
FIG. 145 axis of revolution does not cut the circle.) 

The centre of gravity of the revolving 

circle evidently lies at its centre, so that 
the path described by revolution of the centre of gravity is equal to 27a. 
The area of the revolving figure is za?, and hence we have by Guldin's 
second theorem: 


V = ar 2na = 22?ar?. (36) 


2. To find the surface area F of the anchor ring of Example 1. 

The length of the revolving circle is 2z7; the centre of gravity coincides 
with the centre of the circle, as before, so that we have by Guldin's first 
theorem: 


F = 2nr : 2na = Azar. (31) 


3. To find the centre of gravity G of a semi-circular disc of radius a. We 
take the base of the semi-circle as axis OX, with OY upwards through the 
centre and perpendicular to OX (Fig. 145); it is clear from the symmetry of 
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the figure with respect to OY that G lies on OY. It only remains to find yg. 
We apply Guldin’s second theorem. The solid obtained by revolution of the 
semi-circle about OX is a sphere of radius a, and its volumeis iaa. The 
area S of the revolving figure is +1a?, and hence: 


OR Er ds 
2 YO > Ya = 3 a * 


5 nad = 

4. To find the centre of gravity G' of a semi-circular arc of radius a. 

We take the same axes as in the previous example, and see that @’ also 
lies on O Y, so that we have to find yg,. We apply Guldin’s first theorem, and 
note that the surface area F of the solid of revolution is 42a? in this case, 
whilst s = za, so that: 


4ng?zza.22)9, Yq=2 =. 


As might be expected, the centre of gravity of the semi-circular arc lies 
closer to it than the centre of gravity of the semi-circular disc bounded 
by it. 


108. Approximate evaluation of definite integrals; rectangle and trapezoid 
formulae. It is not always possible to evaluate definite integrals with the 
aid of the primitive on the basis of (15) [96], since the primitive cannot 
always be found in practice, even though it exists, given the continuity 
of the integrand; furthermore, even when it can be found, its form is often 
too complicated for handy computation. This is the reason for the import- 
ance of methods of approximate evaluation of definite integrals. 

Most methods are based on the interpretation of a definite integral as 
an area or as the limit of a sum: 


b n 
$ f (2) de = lim 2 FED (xi — Tia). (38) 


a 


From now on, we agree always to divide the interval (a, b) into n equal 
parts; we let denote the length of each part, so that 


b—a 
n 





, 2=A+ih (=a; =a + nh =b). 


We also let y; denote the value of the integrand y = f(x) for x= s; 
($20,1,...,m): 


y; = f(x) = fa + 1h). (39) 


We take these values as known; they can be found by direct computation, 
if f(x) is given analytically, or from the graph, if it is given graphically. 
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Substituting 
& = ja OF 2 


in the sum on the right-hand side of (38), we get two approximate rectangle 
formulae : 








b 
[reds I ts seid (40) 
j b 
[re as =" ty +12 tty], (41) 
a 


where the sign (~) denotes approximate equality. 

The greater n, i.e. the smaller k, the more accurate these for- 
mulae; in the limit, with n > co 
Y and h-> 0, they give the exact 

value of the definite integral. 
Thus, as the number of ordinates 
increases, the errors of (40) and 
(41) tend to zero. The upper limit 
of the error, for a given number 
of ordinates, is particularly easy 
to determine for the case of f(x) mo- 
notonic in (a, b) (Fig. 146). It is 
X immediately evident from the fig- 
ure that here, the error of each 
Fie. 146 of formulae (40) and (41) does not 
exceed the sum of the areas of the 
shaded rectangles, i.e. does not 
exceed the area of the rectangle 
with the same base (b — a)/n = h 
and with a height equal to the sum 
of the heights of the shaded rec- 
tangles y, — Yo, i.e. the amount: 


b—a 
n 





Ü U-Xp X, X? X3 Xe Xg Xe X70=Xg 








(Un — Yo) - (42) 


Fie. 147 


The rectangle formula replaces 

the accurate expression for the 

area under the curve y — f(z) with an approximation for it, namely the area 

under the stepline, consisting of horizontal and vertical sections, bounding 
the rectangles. 

We get different approximations by taking other lines, differing by a suf- 
ficiently small amount from the given curve, in place of the step-line; the 
closer the auxiliary line comes to the curve y = f(x), the smaller the error 
in taking the area under the auxiliary line in place of the area under the curve. 

For instance, if we replace the given curve with a series of successivo 
chords with their ends at z = z; (Fig. 147), so that the required area is replaced 
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by the sum of the areas of the corresponding trapezoids, we get the approxi- 
mate trapezoid formula : 


b 
[ies n ES 
a 


b= 

2n 

109. Tangent formula, and Poncelet’s formula. We now double the number 
of divisions, by halving each sub-interval. We thus get 2n divisions (Fig. 
148): 


1 
Tos gii =a+ yh, 


ts [yo + Y + Zya +. +++ Yn- + Yal (43) 





Y 





G=ath, ..., «q—a-ih, 





e a 
Titip =a -H ( +3) h, ..., &Q,—b, 
with corresponding ordinates: 
Yo Vu» Up -- Ub Vial» «=> Un Ü Xo Xy Xi Xap Xo Xo, Xy Xy X4 
we shall refer to S 
( Á Soi Yas tipoa Fio. 148 
as even ordinates, and to yj, 
Yg += -> Yn—11 88 odd). 


We draw the tangent at the end of each odd ordinate to cut the two 
neighbouring even ordinates, and we replace the given area with the sum 
of the areas of the trapezoids thus constructed. Tho approximation obtained 
is reforred to as the tangent formula : 


b 
b 
[fas Ro n b ms to + ntl a: (44) 
a 





We consider, together with the above circumscribed trapezoids, the inscribed 
trapezoids, obtainedby joining the ends ofneighbouring odd ordinates with chords; 
we add to these the two extreme trapezoids, formed by the chords that join the 
ends of ordinates y, and Y4)2, Yn—ız and yn. We denote the sum of the areas 
of these trapezoids by 





b—a 
a. [ Yo + yn Vis ain + 231, + 29s 4-. . + 2yn-a19]- 


If the curve y = f(x) has no point of inflexion in the interval (a, b), i.e. is 
only convex or only concave, the area S of the curve lies between areas 0, 
and o, so that their arithmetic mean (0, + o) is naturally taken as an 
approximation for S. This gives us Poncelet's formula : 


b 
ig b—a Yo + Un Jis + Yna 
ESTE 
a 


(45) 
+ 2481, dT... 254] . 
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It is easily seen that, with the assumptions made about the form of the 
curve, this formula gives an error with an absolute value not exceeding 





A ye es (Ha ues u tote). b—a (46) 


2 2 2 án ^C 


where it can easily be shown, by considering the bisector of a trapezoid, 
that the expression in brackets is equal to the segment of the central ordinate 
eut off by the chords joining the ends of the extreme odd ordinates, and the 
ends of the extreme even ordinates. 


110. Simpson's formula. Stil using the previous division into an even 
number of parts, we replace the given curve by a series of arcs of second 
degree parabolas, drawn through the ends of each group of three ordinates: 


Yor YUz Y1? Up US Yas-+ «5 Un-p Un-ti Un. 


We use (4) [101] to find the area of each of the curvilinear figures thus 
obtained, and arrive at Simpson's approximate formula: 


b 
[ie dz ~ 


a 


b—a 
—én "Uo + Ayu, + 291 Ay, + 


(41) 
TF 2y: tet yn- + tynt + yn]. 


We shall not dwell on deducing the error of this formula, or that of the 
trapezoid formula. It may be remarked generally, that expressing the error 
in terms of a definite formula is of theoretical rather than practical interest, 
since it usually gives too crude a limit. 

We note in regard to the above construction that, with suitable choice of 
a, b, and c, a parabola y = az? + bx + c can always be made to pass through 
three points of a plane with different abscissae. 

Smoothness of the curve is vital for the accuracy of practical results, and 
the computation should be more precise in the vicinity of points where the 
curve changes shape fairly rapidly than over intervals of more gradual change. 
It is always useful to draw at least & rough sketch of the curve before starting 
the computation. 

It is essential to make up a table when carrying out numerical computations. 
The following examples are given to show the arrangement of the tables 
and in order to compare the accuracy of the various approximate formulae 
mentioned above: 

n[2 


S = [sinzàz —1, 
0 








— 0.02617994 





6-4 . 015707963, ËZ . 007853981, 2-2 
n 2n 6n 
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Yı sin 9° 0.156 4345 Vis sin 4%.5| 0.0784591 
Ye sin 18° 0.309 0170 Ysl sin 13°.5| 0.233 4454 
Ya sin 27° 0.453 9905 Ys), sin 22.5 | 0.382 6834 
Y sin 36° 0.587 7853 yn, sin 31°.5| 0.522 4986 
Ys sin 45° 0.707 1068 Yo), | Sin 40%.5| 0.649 4480 
Ye sin 54° 0.809 0170 Vn, | sin 49”.5| 0.760 4060 
Ya sin 63° 0.891 0065 Y1s¡, | Sin 58.5 | 0.852 6402 
Ys sin 72° 0.951 0565 yis, | Sin 67.5 | 0.923 8795 
Yo sin 81° 0.987 6883 yi, | sin 76°.5| 0.972 3699 

Yısı, | Sin 85?.5 | 0.996 9173 
2 5.853 1024 Za 6.372 7474 

















Yo sin 0° 0.000 0000 
Vio sin 90? 1.000 0000 


Low VALUE RECTANGLE FORMULA 


Si | 5.853 1024 log Y 0.767 3861 





Yo | 0.0000000 log + = 2 | 7.196 1198 
= 5,853 1024 log S 1.963 5059 
S = 0.919 4080 


Hien VALUE RECTANGLE FORMULA 


























> 5.853 1024 log .N 0.835 8873 
b—a T 
Vio 1.000 0000 log 1.196 1198 
Z 6.853 1024 log S 0.032 0071 
S ~ 1.076 5828 
TANGENT FORMULA TRAPEZOID FORMULA 
log X, | 0.804 3267 2 NX, | 11.706 2048 log Y 1.104 0158 
b—ai =- b— = 
lear See 1.196 1198 Yo Yio} 1-000 0000 log in 2.895 0899 
logS 0,000 4465 X 12.706 2048 log S 1.999 1057 


S ~ 1.001 0290 S ~ 0.997 9430 
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PowNcELETS FORMULA 





























23 12.745 4948 
: log y 1.104 7141 
A (Yo + i) 0.250 0000 
i leg? -— 2.895 0898 
— -y Qus Fey) | — 0.268 8441 
£ 12.726 6507 log S 1.999 8039 
S ~ 0.999 5487 
SIMPSOoN’S FORMULA 
ES 11.706 2048 log y 1.582 0314 
4 Y, 25.490 9896 A A 
Yo + Vio 1.000 0000 log 6a 2.417 9685 
E 38.197 1944 log S 1.999 9999 


S ~ 1.000 0000 





1 + a* 


1 
S= f log EA) ases Z log 2 = 0.272 198 2013 ..., + 
0 


b—a 1 b—a 1 








WA TA BU" 























Y 0.094 3665 Yara 0.048 6685 
Yz 0.175 3092 Ya, 0.136 6865 
Ys 0.240 7012 Ysi 0.210 0175 
Ya 0.290 0623 Ya 0.267 3538 
Ys 0.324 3721 Yo 0.308 9926 
Yo 0.345 5909 Y), 0.336 4722 
Y, 0.356 1263 Ys), 0.352 0389 
Ye 0.358 4065 Ys, 0.358 1540 
Yo 0.354 6154 Ya 0.357 1470 
Ya, |  0.3510273 
2 2.539 5503 Zz 2.726 5583 
Yo 0.000 0000 
Vio 0.346 5736 





t This formula is derived in Volume II 


(110 
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PowcELET'S FORMULA SimpsoN's FORMULA 

| 2 Y, 5.453 1166 23, 5.079 1006 

i (stc 0.086 6434 45, 10.906 2332 

— vy + yaan) | — 0.039 9239 YotYn | 0.346 5736 

= 5.439 8361 = 16.331 9074 
S= > Y ~ 0.271 9918 s= Rx NX 0272198 46 


1 
da 
s= [7 = log 2 = 0.693 14718 ... 
0 


b—a 1 b—a 1 
Co at Ua = qus 











"A 0.952 3810 Vu, 0.975 6097 
Ya 0.909 0909 Ys), 0.930 2326 
Ya 0.869 5653 Ys), 0.888 8889 
Y 0.833 3333 Y, 0.851 0638 
Ys 0.800 0000 Yo), 0.816 3266 
Ye 0.769 2307 Yu, | 0.7843135 
Y, 0.740 7407 Yrs), | 0.7547169 
Ys 0.714 2857 yu, | 072712127 
Y 0.689 6552 Yi, | 0.701 7543 
Yo 0.666 6667 Qu, | 0.077 9661 
Ya 0.645 1613 Yar, | 0.655 7377 
Ye 0.625 0000 Yas), | 0.634 9207 
Vis 0.606 0606 Yes, | 0.615 3846 
Ya 0.588 2353 Yan, | 0.597 0149 
Yis 0.571 4287 Yas | 0.597 7101 
te 0.555 5556 Us, | 0.563 3804 
Yu 0.540 5405 Yoo), | 0.547 9451 
the 0.526 3146 Vs, | 0.533 3333 
Y, 0.512 8205 Yor}, | 0.519 4806 

yoo, | 0.506 3291 
PA 13.116 6666 X; 13.861 3816 
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TRAPEZOID FORMULA 








Yo 1.000 0000 25, | 26.2321332 
Yoo 0.500 0000 Yo + Yao] 1.500 0000 








z 27.732 1332 








S= dy X ~ 0.698 303 33 
PoNcELET’s FORMULA Srmpson’s FORMULA 
Xx 27.722 1632 2, 26.232 1332 
L (Yo + Yeo) 0.375 0000 4N, 55.445 5264 
1 
= (yu, + Y80/,) — 0.370 4847 Yo F Yzo 0.500 0000 
= 27.727 2785 PH 83.177 6596 
1 1 : 
S= -Jp 2 ~ 0.693 181 96 S = 0 2 ~ 0-693 147 16 


111. Evaluation of definite integrals with variable upper limits. The eval- 
uation of a definite integral with variable upper limit, 


F(a) e jæ) di 


is required in numerous problems. 

Taking the trapezoid formula (43) as basis, the following method can be 
indicated for finding the approximate value of this integral — not for all x, 
of course, but for those by which the interval (a, b) is subdivided, i.e. we 
find: 

F(a), F(x), F(x), ..., Flay), ..., F(x, 4), F(b). 


We have by (43): 


a kh 
F(x) = | f(a) de h| Lo op Hen FH, (48) 
a 
a+(k+1)h 
F(xy4) = f f(x) dz ~h [-5 35. HT. Yer Ue Yk + 
^ (49) 


y Yet Uk + ji Met | F(a) += 5 Mk + Yk) 














I H HI IV V VI 
S he 
k y Vk | Sk Vet Yen Sn F(zy)—-.h Ss 
n=1 n=1 
0 a Yo 0 0 
8 = Yo + V1 
1 
1 ath Yi $1 3^ 
& =Y, + Va 
1 
2 a + 2h Ya 8, + 8, 3 Als + s) 
83 — Us + Ys 
1 
3 a + 3h Vs 8, F 55 + 8, 3 ^ (81 +82 + $3) 
8,77 Va + Ya 
1 
4 a + 4h Ya 81 +8 +8 +8, Gh (81 + 82 + 83 + 8) 
$5 =Ys + Us 
1 
5 a + 5h Ys 8, + 83 + 83 F 8, + 8; y h (81 + 82 + 83 + 84 + 85) 
8 = Ys + Ye 
1 
6 a + 6h Ye 8, + 83 + 83 + 8, + 85 + 8 y Ah +82 + 8384 + 85 + 89) 























VIAMUOA S.NOSIMIS Cott 


183 
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Having found F(z,), this formula enables us to pass to the next value 
F(a) = F(x, + h). 
This computation can be set out as shown on page 287. 


112. Graphical methods. These computations can be made graphically, if 
the graph of the curve y = f(x) is given; in this case, we construct the 
graph of the integral curve: 


y = fa) de = FG) 











X 
Fic. 149 
from the graph of the curve 
y = f(x). (50) 


First of all, if we have enough divisions, we can take approximately 


Y Y 
3 


3 = Vk-ihs (51) 


ie. if the graph of (50) is drawn, the values s,/2 can be found directly from 
the figure, as the ordinates of the curve for zy 1, — a + 2k — h/2 (Fig. 149). 
We mark on OY the points: 


A, (nu) Ag (Yala), +.» Ag (0414). 
We take P to the left of O on OX, so that OP is unity. We draw: 
PA, PA, PA; ..., PAx, 
then draw parallels through M, Mı, Ma, ..., so that 
MM, || PA, M,M; || PA, M¿M¿ || PA, ... 
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Points Mo, Mı, Mz, ... are points of the required approximate integral 
curve, since it can easily be observed from the figure that 


Yi M, = hys, T. M, = h(Yua + Ysl)» Tz M; = h(yu, + Yala + Y5/2) 
and this shows, by the approximate equation (51), that: 


Lg My = hu, + Yoly te +++ Yki) = 


= (2i TNT Y C] — Fn) 


by (48). 

The above construction is for the case when the scale of F(x) is the same 
as the scale of f(x). If the scale of the area is different, the construction is 
the same except for making OP of 
length J instead of unity, where ! is 
the ratio of the scale of F(x) to that 
of f(x). 

An approximate graphical con- 
struction of the iterated integral 


(2) = fan i f(z) dz) 


is based on rectangle formula (40) 
[108]. 
As before, let 





F(z) = i f(x) de. Fig. 150 


Taking only the values £o, £i, 25, ..., Zn, ... of the independent variable 
zx, (40) gives us the approximate equations: 


F(x) ~ hyo, F(z;) ~ Myo + Ya), «+ +> Flay) ~h (Yo tH Yit.. + Yui) > 
Applying the same formula to (x), we have: 
P (xy) = h [F (£0) + P(e) +... + F(£k-1)] ~ 
c R [yo + (Yo FYD +.. (Yo tY t + Ye). (52) 


This leads us to the following construction for the ordinate D(x,) (Fig. 
150): taking P as before, we cut off on OY: 








0OB,=Yw BiB2=Yy BB; = Yz.. -o BriBr= yk. 
We join: 
PB,PB,...,PBy ..., 


and take points: 
Mo Mi M,, ...,My,..., 
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so that 
MM, || PB, M,M, || PE, M,M, || PB, ... 


These points are points of the required approximate curve, drawn, however, 
on the invariable scale (1: A), since it is clear from the construction that 


a M, =hyy Ty Mg = hyo + Yo + Ya), Eg Mg = 


o 
= hyo + hys + Y) mb hys d Vi dr vie) UR 


by (52). If PO is l instead of unity, the curve drawn gives the ordinate of the 
curve @(x), changed in the ratio 1 : Uh. 

It should be mentioned that the accuracy of the above constructions is 
not great, for all their convenience, and they are only useful for fairly rough 
ealculations. 


113. Areas under rapidly oscillating curves. We mentioned in [110] above 
that the successful applieation of the various approximate formulae for eval- 
uating definite integrals depends on dividing the curvo, the area under which 
is to be found, into sections in which it has a smooth shape. 

This is à very awkward requirement in the case of curves that behave 
irregularly, with frequent oscillations. If the area under such a curve is 
to be found in aecordance with the above rule, too many subdivisions have 
to be introdueed, and the computation becomes very involved. 
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A second method is useful in these cases, and consists in dividing the area 
into strips parallel to OX, instead of OY. To find the approximate area under 
the curve of Fig. 151, we mark off on OY the least and greatest ordinates 
a and f of the curve, and divide the interval (a, $) into n parts with the points: 


Yo = 0, Uv in Nip Yi 0 Yn- Yn = B. 
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We draw the parallels to OX through these points, thus dividing the total 
area into strips, made up of separate pieces; an approximate expression for 
the area of the ith strip can be found by taking the product of its base (y; — 
— Y¡-1) and the sum of the lengths /; of the segments cut off on any line 


Y =M (yii S "i € yi) 


contained within the area in question; this sum can be found directly from 
the figure. If 7, denotes the sum, we get an approximate expression for the 
required area S of the form: 


Yob — a) + (Y, — Yo) li + (ya — 90 b + eee + (Un — yas, 


which increases in accuracy, the greater the number of divisions and the 
sharper the oscillations of the curve. 

Suitable development of the basic idea of this method leads to the con- 
cept of Lebesgue integrals, which is far more general than tho concept describ- 
ed above of Riemann integrals [94, 116]. 


§ 11. Further remarks on definite integrals 


114. Preliminary concepts. The last paragraphs of the present chapter 
are devoted to the rigorous analytic consideration of the concept of 
integral, and in fact we shall prove below that a sum of the form 


> HE) (xy — Tri) 


k=1 


has a definite limit not only in the case of a continuous function. To do 
this, we have to introduce some new concepts regarding discontin- 
uous functions. Let f(x) be defined in a certain finite interval (a, b). 
We shall consider only bounded functions, i.e. those with absolute 
values less than a given positive number throughout the interval. To 
be precise, a function f(x) is said to be bounded in the interval (a, b), 
if there exists a positive number M, such that we have for all x of the 
interval : 


Ife) | « M. 


Tf f(x) is continuous, it attains a greatest and least value in the inter- 
val, as already remarked [35], and thus it is obviously also bounded. 
Discontinuous functions, on the other hand, can be unbounded as well 
as bounded. We shall only consider bounded discontinuous functions 
in future. We suppose, for instance, that f(x) has a graph as shown in 
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Fig. 152. We have a discontinuity at x = c, and the value of the 
function at this point, i.e. f(c), has to be defined by means of some 
supplementary condition. The function is continuous at the remaining 
points of the interval, including the ends a and b. Also, as z tends to c 
from. smaller values, i.e. from the left, the ordinate f(x) tends to a 
definite limit, denoted geometrically by N-M,. Similarly, if x tends to 
c from larger values, i.e. from the right, f(x) again tends to a definite 
limit, denoted by NM,, this latter limit being different from the 
previous, left-hand limit. The left-hand limit is usually denoted by 
f(c — 0), and the right-hand limit by f(c + 0) [32]. The above very 


Y 





C2 
Fra. 152 Fra. 153 


simple type of discontinuity, where a finite determinate limit exists 
both on the left and on the right, is usually called a discontinuity of the 
first kind. The value of the function at the point itself, x — c, i.e. 
f(c), will in general differ both from f(c — 0) and f(c + 0), and requires 
supplementary definition. If a function is continuous in an interval 
(a, b) including the ends, with the exception of a finite number of 
points where it has discontinuities of the first kind, the graph con- 
sists of a finite number of curves that are continuous as far as their 
ends, together with distinct points at the places where the discontinui- 
ties occur (Fig. 153). Such a function is evidently bounded throughout 
the interval, in spite of its discontinuities. At the same time, of course, 
functions with more complicated discontinuities can also be bounded. 

We shall often be considering, further on, the set of all the values 
taken by a function f(x) in a given interval of variation of the inde- 
pendent variable. If the function is bounded in the interval, the set 
of its values in the interval is bounded above and below, and hence 
the set has a strict upper and a strict lower bound [39]. If, for instance, 
f(x) is continuous in the (closed) interval, it attains a greatest and a 
least value in the interval, as we know from [35]. In this case, the 
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greatest. and least values of the function are also the strict upper 
and lower bounds of the values of f(x) in the interval. We take another 
example. If f(x) is an increasing function, it takes its greatest value 
at the right-hand end of the interval, and its least value at the left. 
As in the previous case, these values are the strict upper and lower 
bounds of the values of f(x). In both examples, the strict bounds of 
the values of the function are particular values of the function, i.e. 
they themselves belong to the set of values of the function. In the 
more complex case of discontinuous functions, the strict bounds 
of the values of the function may not themselves be values of the 
function, i. e. they may not belong to the set of values of the function. 

Let M and m be the strict upper and lower bounds of the values 
of f(x) in the interval (c,d), i.e. for c < x « d, where obviously, 
m < M. We take a new interval (c”, d"), which is only part of (c, d). 
Let M’ and m/ be the strict upper and lower bounds of the values 
of f(x) in the new interval (c”, d^). Since the set of all the values of 
f(x) in (c^, d^) must always be included among the values of f(x) in the 
wider interval (c, d), we can say that M’ < M and m’ > m, i.e. if the 
interval of variation of x is replaced by a part of the interval, the strict 
upper bound of the values of function f(x) cannot increase, and the strict 
lower bound cannot decrease. This fact has great importance for us 
later on. 


115. Darboux's theorem. Let f(x) be bounded in the interval (a, 5), 
and let m and M be the strict lower and upper bounds of its values 
in the interval. We divide the interval with points corresponding 
to the values of a: 


a = ty dO ILL C Bpo E Ro < E, = D, 


and introduce the lengths of the sub-intervals 0, = z, — zy ., (k = 
—1,2,...,m) Let x= & belong to the sub-interval (£pi; v) 
(k= 1, 2, ..., n). We form the sum of the products: 


X605. (1 


The value of this sum depends, firstly on the method of division 
of the interval (a, b), and secondly, on the choice of x = ¿, in each 
sub-interval. Our problem is to investigate the limit of the sum when 
the number n» of sub-intervals increases indefinitely and the greatest 
of the lengths 6, tends to zero. We have to decide in what cases it is 
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possible to speak of this limit, i.e. decide for what functions f(x) 
the sum (1) tends to a definite limit, independently of the method of 
division of the interval and of the choice of &,. 

We consider f(x) in each sub-interval, and let M, and m, be the 
strict upper and lower bounds of its values in (x,_1, %x). We replace 
f(£y) in the terms of sum (1) by M, or mg. 

We thus arrive at the following two sums: 


n 
S= NM, (2) 


k=1 
n 

s= Nm; On, (3) 
k=l 


with the inequality following directly from the definition of strict 
bounds: 


my < f(E) < My 
whence we have, since 0, is positive: 
n 
s« Nf(5)5,« S. (4) 
k=1 


We consider sums S and s in more detail, then return to the more 
general sum (1). In accordance with the remark of the previous article, 
My, and my always satisfy the inequality 


m«m,«M,«M 


and in addition, obviously: 


n n 
OK = (m — mo) b — a. 
k= 


k= 


L 


It follows directly that 
mô, < My 0, < Mô, and mó, < m0, < MÓ,, 
whence we get by summing over k: 


m(b— a) < Y M,8, « M(b —a) and 


k=l 
m(b—a) < mó, < M(b— a), 
k=l 


ie. sums S and s always lie between the bounds m(b — a) and 
M(b — a) for any division of the interval. 
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If we take all the possible divisions of the interval (a, b), we get 
infinite sets of values for both sums (2) and (3). It follows from what 
has just been said that both these sets are bounded, and hence, both 
have strict upper and lower bounds. 

We examine the sum S more closely, and assume for the present 
that all the values of f(x) are positive. In this case, all the terms of S 
are also positive. We suppose that the interval (a, b) is divided into 
subintervals 6, in a definite way, so that S has a definite value. 

We make a further division of the sub-interval 6,.f For instance, 
let a given 6, be divided into three parts: 80, 62 3 and let the 
corresponding strict upper bounds of f(x) be MP, MP, and MP. 

By the remark of the previous article, these strict upper bounds are 
never greater than the strict upper bound for the full sub-interval 
Òk, Le. 


MY, MY, and Mf? < M, (5) 
and in addition, obviously: 
5D + 0D Lap = bp (6) 


After the further division of 6, into three parts, the term M, 6, 
of S is replaced by three terms: 


Mp, MP SP and MY ag, 
and we have by (5) and (6): 
MPP + UP ID + Mpap < Mo, (2 


ie. 4f we start from a definite division of the interval, then further 
sub-divide individual sub-intervals 6,, the sum S can only diminish, 
or more precisely, cannot increase. Instead of considering as a 
whole the new sum that we get by sub-dividing individual sub- 
intervals ôx, we shall later be considering only part of its terms, 
ie. we shall be discarding certain terms of the new sum. Since all 
the terms are positive, the discarding of certain terms can only de- 
crease the value of the complete sum, i.e. the new sum will become 
a fortiori not greater than the original sum S, which we had before 
sub-division of the sub-intervals ó,. 

We have compared the two values of S corresponding to two 
types of division of the interval (a, b), the second type of division 
being obtained by further subdivision of the first, in such a way that 
all the points of division of the first were preserved in the second. 


[ For brevity, we use 6, to denote both the sub-interval and its length. 
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When the two values of S8 are compared for two arbitrary methods 
of division of (a, b), there will in general be no simple relationship 
between them. But it is clear that, if the lengths of sub-intervals 6, 
are sufficiently small for both methods of division, the values of S will 
be close together in magnitude. More precisely, it can be shown that, 
on indefinite increase in the number n of divisions, and on indefinite 
decrease of the greatest of the increments 6,, S tends to a definite 
limit, independently of the method of division of the interval (a, b). 

We now proceed to the proof of this useful proposition. 

We take all the possible values of S, obtained by all the possible 
divisions of (a, b). Let L be the strict lower bound of this set of 
values of S which is bounded above and below. We show that L 
is the abovementioned limit of S. 

By the definition of strict lower bound, we have L < S for all S. 
To prove our assertion that L is the limit of S, we have to show that, 
for any given positive e, there exists an 1 such that all S are less than 
(L + e) for all 6, less than 7. 

It follows from the definition of Z as the strict lower bound of values 
of S, that there exists a fully defined method (I) of division of (a, b) 
into sub-intervals ôx, such that the corresponding value of S, say 8’, 
is less than (L + e/2). Let p be the number of points of division of the 
total interval (a, b) according to method (I). We take any other method 
(II) of division of (a, b), and we let à, be the lengths of the sub-intervals 
(a1, Tx) as usual. We separate all the increments 6, into two classes. 

We put those that are wholly contained in one of the sub-intervals 6; 
in the first class, 6, being obtained by method (I), and we put those that 
run into more than one ôk in the second class. Let o, be the lengths 
of the sub-intervals of the first class, and Tm the lengths of those of 
the second class. Further, let u; and vm be the strict upper bounds 
of f(z) in the sub-intervals o; and Tm. Subscripts / and m run through 
certain integers that are of no interest to us, and in future, when summ- 
ing over these subscripts, we shall not indicate the limits of summation, 
it being assumed that summation takes place over all the sub-intervals 
of the first or second class. When separating the 6, into two classes, 
we can at the same time separate the total sum S obtained by method 
(II) of division into two sums: 


S=8,+8,, 


where 


8, = X og Sa = 9% Tm: 


115] DARBOUX'S THEOREM 297 


Each o, forms part of a 6; of the first method (I) of division, whereas 
all the c; do not fill up all the ôç, i.e. S, can be found from S’ by sub- 
dividing the 6, and discarding certain terms. We can therefore say, 
by what was proved above, that S, is not greater than S", i.e. we can 
write, since S^ < L + e[2 e, 


S, <L+ 3 (1) 


We now consider the second sum S,. The sub-intervals Tm run into 
more than one (at least two) 6, of method of division (I), i.e. tm 
covers at least one point of division of (a, b) in accordance with method 
(I), so that the number of terms in S, is not greater than the number p 
of points of division by method (I), where p is a definite positive integer. 
The factors vm do not exceed the strict upper bound M of f(x) in the 
whole of (a, b). If t denotes the greatest of the Tm, every term of S, is 
not greater than Mr, and hence, we have for the whole sum: 


S, < Mrp. (8) 


We now take 7 equal to £/2 Mp and show that it satisfies the condition 
stated above. We therefore suppose that all the 6, of method (II) of 
division satisfy the inequality: 


€ 
Since the Tm are certain of the ôx, we have v4 < e/2Mp, i.e. 
€ 
TS 3p 
and (8) gives us for £,: 
€ 
Sy < ue (10) 


On adding inequalities (7,) and (10), we get for the total sum 8: 
S<L+petyeaLte. 


We thus have the inequality for the sum £: 
L<S<LIL+e, 


for any method of division of the interval (a,b), provided only 
that the lengths of the sub-intervals satisfy inequality (9). 

In view of the arbitrary smallness of the given positive e, we con- 
clude from this that L is in fact the limit of sum £. 

We have assumed in the above discussion that all the values of f(x) 
are positive. If this is not the case, we can always add a positive con- 
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stant A to the bounded function f(x), such that the new function 
y(x) = f(z) + A is positive. Our proposition can now be considered 
proved for the new function y(x), i.e. S has a definite limit for the 
new function. Since the strict upper bound of y(x) is evidently 
M, +A in (z, 4,2), using the previous notation, we see that the 
sum for p(x) has the form: 


>t A) 8, = > Myo, + A> 6,= SH A(b — a), 
k=1 = 


k= k=1 k=1 


n 


where, as above, M, is the strict upper bound of f(x) in ôx 
We now consider the formula obtained: 


(M, + 4), = > M,d,+ A(b — a). 


k=1 


lv a 


k 


Lu 
= 


The left-hand sum has a definite limit, as remarked above, equal 
to the strict lower bound of the values of the left-hand sum. 
We have two terms on the right-hand side, one of which, A(b — a), 


n 
is a definite number, so that we can say that the other term © Mp ôx 
kal 


also has a definite limit, equal to the strict lower bound of the set of 
values of this sum. 

We have thus shown that the sum S has a determinate limit L for 
any bounded function f(x) in a finite interval. It can similarly be 
shown that the sum (3) also tends to a determinate limit 1 on indefinite 
decrease of the greatest of the 6,. This number 1 is the strict upper 
bound of all the possible values of the sum s for all the possible divi- 
sions of interval (a, b). Moreover, noting that m, < Mp, and comparing 
expressions (2) and (3) for sums S and s, we see that, given the same 
method of division, we always have s < S. We therefore get the same 
inequality for the limits, i.e. | < L. We formulate the result obtained 
as the following theorem, due to the French mathematician Darboux: 

DARBOUX's THEOREM. The sums s and S tend to definite limits l and 
L, where 1 < L, for any function bounded in the interval (a, b), on inde- 
finite increase in the number of sub-divisions and indefinite decrease of 
the greatest of the dx. 

We said above that / is the upper bound of values of s and L the 
lower bound of values of S. Having proved that I < L, we can there- 
fore say that s < S, whatever the method of division used for forming 
s and £. 
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116. Functions integrable in Riemann's sense. If we now turn to 
the general sum: 


SHED (mm med (1) 


it seems that we cannot yet assert the existence of a limit in the case 
of any bounded function f(x) 

There is some uncertainty about the magnitude of the factor f(&,), 
since é can be chosen in any manner from the sub-interval (zy. 1, zy). 
It is due to this fact that the sum (1) does not always have a definite 
limit. We can suppose, for instance, that the limits | and L, referred 
to in Darboux's theorem, are not identical, ie. that | < L. By the 
definition of strict upper and lower bounds, we can choose £,, on the 
one hand, so that f(£;) is as close as desired to mą, and on the other, 
so that f(£y) is as close as desired to My. The sum (1) will be as close 
as desired to the value of the corresponding sum s in the first case, and 
to that of S in the second case. We can thus, by suitable choice of ¿£;, 
make (1) as close as desired either to J (the limit of s) or to L (the 
limit of S), on indefinite decrease of the 6,. Since 1 differs from L by 
hypothesis, it is evident that (11) has no definite limit, on indefinite 
increase of n and indefinite decrease of the greatest of the ôx. We 
have thus shown that (11) has no definite limit, if l < L. 

We now show that, if | = L, (1) has a definite limit, equal to | = L. 
We have, in fact, by the definition of strict upper and lower bound, 
my < f(Ex) < My, and we can therefore write: 


n 


>m,5,< > HE) Ôp ` < SM, 


k=1 =1 k=1 


T 


The extreme terms of this inequality have the common limit / — L, 
on indefinite decrease of the greatest of the ôx, and hence the sum 


n 
x) Ox must tend to the same limit, for any choice of £,. we 
Ex) 6 d h limit, f hoi f E,. As 
k=1 


know, the limit of this sum is referred to as the definite integral of 
f(x) over (a,b), and if this limit exists, the function is said to be 
integrable in Riemann's sense, or simply, integrable. Different defini- 
tions to the above are given to the definite integral in certain cases, 
and of course, the condition for integrability is then different. We speak 
of integrability in Riemann's sense (Riemann was a mid-nineteenth 
century German mathematician) so as to distinguish his method of 
forming the definite integral from other methods. Since we shall only 
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be concerned in future with Riemann integrals, we shall not enlarge 
on this remark, and functions integrable in Riemann's sense will 
simply be referred to as integrable. 

It follows from the above, that a necessary and sufficient condition 
for the integrability of f(x) consists in the coincidence of the limits 1 and 
L of sums s and S, i.e. in the fact that the difference between these sums: 


n 
> (4, — mg) Dx » (12) 
k=1 
tends to zero on indefinite increase of n and on indefinite decrease of the 
greatest of the increments 0,. We investigate some classes of function, 
for which the above condition is satisfied, i.e. some classes of 
integrable function. 

The sum (12) consists of non-negative terms, and its magnitude is 
not less than L — l, since L is the strict lower bound of sum (2), 
and / the strict upper bound of (3). We can say from this, and from 
Darboux’s theorem, that a necessary and sufficient condition for 
integrability, i.e. for coincidence of | and L, may be expressed as 
follows: for any given positive e, there exists a division of the interval 
(a, b), such that the sum (12) is less than e. 


1. If f(x) is continuous in (a, b) (including the ends), it is uniformly contin- 
uous in the interval. Furthermore, it attains a least value m, and a greatest 
value M, in each sub-interval 6; By the uniform continuity of f(x), the 
positive differences M, — m, will be less than any positive e on indefinite 
decrease of the greatest of the increments 6,, and we shall have for the total 
positive sum (12): 


n n 
0 < Y (M; — mp; < Y eô; = e(b — a). 
i=1 i-1 


Hence it follows, by the arbitrary smallness of e, that (12) tends to zero, 
ie. every continuous function is integrable. 

2. We now take f(x) bounded, with & finite number of discontinuities. 
We assume, for clarity, that it has one discontinuity at x — c inside (a, b). 


oO 0 0-0 
a a C b, b 


Fic. 154 


The case of any finite number of discontinuities can be considered in exactly 
the same way. Since the limit L — 1 of (12) does not depend on the method 
of division of (a, b), we can use whatever method of division we like for our dis- 
cussion, with the only proviso that all the increments 6; tend to zero. We isolate 
the point c from (a, b) by means of a small sub-interval (a,, b,) (Fig. 154) such 
that c is an interior point. The sub-interval is defined more accurately later. 
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We have | f(x) | < N, since f(x) is bounded, i.e. we have all M; < N and 
all m; > —N, so that: 


0< M,—m,« 2N. (13) 
Let e be any small given positive number. We choose (a,, b,) such that 
2N(b, — a,) < e. (14) 


We shall assume that points a,, b, form points of division of (a, b). The 
sum (12) is now divided into three parts: S,, corresponding to the interval 
(a, a,); Sz, corresponding to (b,, b); and S,, corresponding to (a,, b,). 

Since f(z) is uniformly continuous in (a, ay), S, tends to zero, as in 1. 
Similarly for S,, ie. S, and S, are less than e for all sufficiently small ó,. 

To obtain S, we have to perform the summation in (12) over those of 
the 6, that belong to (a,, b,), the sum of these ô, being evidently equal to 
(b, — a,). Using (13) also, we have: 


0< S,« X'2Nó, = 2N X òp = 2N(b, — a,), 


where the summation is over the above-mentioned 6,. By (14), we have S¿<e, 
and the total positive sum (12) is less than 3e. Hence we can conclude, since 
e is arbitrarily small, that the sum tends to zero, ie. every bounded function 
with a finite number of discontinuities is integrable. We had such a function 
in the first example of [97]. 

3. We take the case of f(x) monotonic and bounded in (a, b). We assume 
for clarity that the function is non-decreasing, i.e. if c, < Cy, f(c,) < f(c). 
We now have in each ô, M, = f(x) and m; f(vj ,). The sum (12) will 
be: 


n 
& Uf (ai) — H01- 01 (21 — 21-4). 


i=l 


Let 4 denote the greatest of the (x; — x;_,). By hypothesis, 4 — 0. 
Since f(x;) — f(x%;_,) > 0, we can write: 


0 < X en) — Hea} (i — mi) < A Y Led — fea. 
i.e. 
0 < X Ue) — fei 61 — 21) < AUG) — fia 
since obviously: 
Š Ute) — erd] = er) — Ha) + Ute) — fel te 
ee FANG) — fern] = 1b) — Ka). 


We thus see at once that (12) tends to zero, i.e. every monotonic bounded 
function is integrable. We remark that a monotonic function can have an 
infinite set of discontinuities, so that case (III) is not dealt with by case (II). 
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We can take as an example the function equal to zero for 0 < æ < 1/2, 
equal to 1/2 for 1/2 <2 < 2/3, equal to 2/3 for 2/3 «x < 3/4, and so on, and 
finally equal to 1 for x = 1. 

The discontinuities of this non-decreasing function are at: 





We remark that every monotonic bounded function must have limits 
f(c — 0) and f(c + 0) at every discontinuity x = c. This follows directly from 
the existence of a limit for a monotonic bounded sequence [30]. 

Wo have found the condition for integrability on the assumption, through - 
out, that f(x) is bounded. It can be shown that this is a necessary condition 
for integrability, i.e. for the existence of a definite limit of the sum (11). 
If the condition of boundedness is not satisfied, the integral of f(x) over the 
interval (a, b) can still be defined in certain cases, though no longer as the 
limit of (11). The integral is then said to be improper. The basic principles of 
improper integrals were explained in [97]. They are described in more detail 
in Volume II. 

If integration is over an interval that is infinite at one end or the other, 
the definite integral can again not be conceived directly as the limit of 
a sum of the form (11), and we again have an improper integral (cf. [98] 
of Volume II). 


117. Properties of integrable functions. The basic properties of 
integrable functions are easily obtained by using the necessary and 
sufficient condition for integrability given above. 

I. If f(x) is integrable in (a, b), and we arbitrarily change the value 
of f(x) at a finite number of points of (a, b), the new function will also 
be integrable in (a, b), and its integral will not differ from that of f(z). 

We confine ourselves to the case when f(x) is changed at one point, 
say x= a. The new function g(x) coincides with f(x) everywhere 
except at x = a, p(a) being taken arbitrarily. Let m and M be the 
strict lower and upper bounds of f(x) in (a, b). The strict lower bound 
of y(x) will evidently be greater than or equal to m, if p(a) > m, 
and will be g(a) if g(a) < m. Similarly, the strict upper bound of 
p(x) will be less than or equal to M, if g(a) < M, and will be g(a) 
if g(a) > M. Comparing the sum (12) for f(x) and g(x), we note that 
there can only be a difference in the first term (for k = 1). But this 
first term evidently tends to zero for f(x) and (x), since 0,—- 0 
and (M, — m,) is bounded. The sum of the remaining terms, excluding 
the first, evidently also tends to zero, since f(x) is integrable, and the 
entire sum (12) must tend to zero in the case of f(x). The integrability 
of g(x) is proved. The identity of the integrals of f(x) and (x) is 
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evident, since we can always take ¢, different from a when forming 
the sum (11), and f(x) and g(x) coincide at all points except x = a. 

TI. If f(x) is integrable in (a, b), it is integrable in any (c, d) forming 
part of (a, b). 

We can find the limits l and L of sums s and S on the assumption 
that c and d are among the points of division of (a, b). The sum 
(12) for (c, d) is now obtained from the sum (12) for (a, b) simply by sub- 
tracting the terms corresponding to (a,c) and (d, b). Since the terms 
are non-negative, (12) for (c, d) is less than or equal to (12) for (a, b), 
so that if the latter sum tends to zero [f(x) integrable in (a, b)], the 
former will certainly also tend to zero, i.e. f(x) is integrable in (c, d). 
We remark that c can coincide with a, whilst d can coincide with b. 
The equality is proved as in [94]: 


b € b 
A f(z) dz = Ste) dz + Ste) dz (a<c< b). 

III. If f(x) is integrable in (a, b), cf(x), where c is any constant, is 
also integrable in (a, b). 

Taking c > 0, for instance, we see that the previous m, and M, have 
to be replaced by cm, and cM, for cf(z). The sum (12) only acquires 
the factor c and will tend to zero as before. Property V of [94] is 
evidently preserved and is proved as before. 

IV. If f,(x) and f,(x) are integrable in (a, b), their sum pla) = f,(x) + 
+ f(x) is also integrable in (a, b). 

Let my”, My”, my”, M," be the strict lower and upper bounds of 
f(x) and f(x) in the sub-interval (z,_1, x4). Then all the values 
of f(x) in (£k-1, xy) are greater than or equal to mg’, and all those 
of f(x) are similarly greater than or equal to mg”. Hence, we 
have g(x) > mg + my” in (z,.,, xy). We can show similarly that 
g(z) « My + My” in (£r xy). We let m, and M, denote the 
strict lower and upper bounds of g(x) in (x,.,, Tx), so that we have 
Mx > My" + my” and M, < Mge + My", whence it follows at once 
that: 

. M, — m, < (Mj + Mi) — (my + mj, 
i.e. 
M,— m, < (Mj — Mx) + (Mi — mi). 


Forming the sum (12) for p(x), we get: 


n n n 
0< Nw(M,—m,)ó, « X (My — Mi) ôr + X (Mi — mi) ôr- 
[mesi ki k=l 
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Both sums on the right tend to zero, since f,(x) and f,(x) are inte- 
grable by hypothesis. It follows that sum (12) for g(x), i.e. 


n 
D (UM, — Mi) ôr 
k=1 
certainly tends to zero, i.e. g(x) is also integrable. The proof is easily 
extended to the case of the algebraic sum. of any finite number of 
terms. Property VI of [94] is proved as before. 

The proofs of the following properties are similar to the above: 

V. The product of two functions integrable in (a, b) is also integrable 
in (a, b). 

VI. If f(x) is integrable in (a, b), where its strict lower and upper 
bounds have the same sign, 1/f(x) is also integrable in (a, b). 

VII. If f(x) is integrable in (a, b), its absolute value | f(x) | is also 
integrable in (a, b). 

The inequality (10) of [95] can be proved as before. The proof of 
property VII of [95] remains the same, if f(x) and g(x) are integrable 
functions. The mean value theorem reads: if f(x) and g(x) are integrable 
in (a, b), and g(x) preserves the same sign in the interval, then 


Site) wa) de = n | (a) a) dz, 


where y is a number satisfying m < u < M, and m and M are the 
strict lower and upper bounds of f(x) in (a, b). In particular: 


3j f(x) de = u(b — a). 


The proof is as before [95]. It can easily be shown, with the aid 
of this formula, that 


F(z) =f f(t) at 


is a continuous function of v, and F’(x) = f(x) for all values of x at 
which f(x) is continuous. We finally establish the basic formula for 
the evaluation of integrals of integrable functions. Let F,(x) be con- 
tinuous in (a, b), and let its derivative F,'(x) = f(x) for any x inside 
(a, b), where f(x) is integrable in (a, b). 

We have the basie formula in this case: 


feda = F, (6) — F, (0). 
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On dividing the interval as usual, and applying the formula for 
finite increments [63] to each sub-interval (2,_1, tx), we can write: 


P, (24) — Fy(%y—1) = Fi (En) On = f(£) On (xy < Ex m). (15) 
Further, summing over k and noting that (III of [116]): 


y (Pa (n) — Fa ta)] = F0) — Fy(a), 
we get: 


F,(b) — F(a) = Shee by 


This equation applies for any division of (a,b), in view of the nature 
of the choice of the &, which is determined by the formula for 
finite increments (15). We get the integral instead of the sum on 
passing to the limit: 


F, 0) — F, (a) = f fa) àz, 


which it was required to prove. We remark that the values of f(z) 
at the ends of (a, b) play no part in the definition of the integral, by 
property I of the present section. 
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Find the integrals of the function 1— 20: 
l.5a?49. 2. (6x? + 8x + 3). 3. z(xz4-a)(x-- b). 4. (a + br}. 
5. l (2px). 6.7". 7. (n) I^, 8, (a2ls — als). 
9. (Vx + 1) & — Vz + 1) 
10, Le | 
Vas 
1. (z" — z"ylyz. 12. (Va — Vx) Vaz 13. 1/(2? + 7). 
14. 1/(1? — 10). 15. 1//4 + a?. 16. 1//8 — 22. 
(2h a* — y2 — 2? 
17. == 
V/4—= a2 
18. tan? x. 19. coth? x. 20. 3% e”. 
Evaluate the integrals 21—25 by considering them as limits of sums: 


b T 
21. ( dz 22. f (v, + gt) dé. 
å 0 








1 10 5 
23. (a?dz. 24. ¡dx 25. [arde 
-9 0 Y 
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Find the derivatives with respect to x of the functions 26—29: 
x 
26. d log tdt. 


27. SVIF" Fiat, 28. feat. 
x 


29. i sd (12) dt. 
1/x 
30. Find the turning points of the function 





z 
y=| Has for x — 0. 
0 


By evaluating the corresponding integral evaluate the limits 31— 33: 
31. lim (1/n? + 2/n? +...+ (n — 1)/n?), 
n>%w 
32. lim [1/(n +1) + 1(n 4- 2) +... + 1/(n+m)], 
33. lim 1 +2 eo ue PS), 


Evaluate the integrals 34—45: 





34. je — 2z--3)dz. 35. IE 2z + Vz) dz. 


4 
36. J (1+ Vy)/ly2dy. 37. f /z — 2 de. 





dz 
aa d e 39. | — 
Í V25 + 3x ie 1 
0 —2 
1 dz -1 d 
a y5dy 
40. f s RTI a. [2773 
0 —1 
dz dx 
alar 43. | ri — 3x +2 
1 sdz n[4 
E 45. f sec? z dz. 
0 n[6 


Find the indefinite integrals of the apr 46 —122: 
46. a/(a — x). 47. (2x + 3)/(2x + 1). 48. (1 — 32)/(3 + 2x) 
49. x/(a + bx). 50. (ax + b)/(a x + B). 51. (a? + 1)/(x — 1). 
32. (a?--5z--7)/(x--3). 93. (z* -3?-- 1)/ (x 





1). 54. [a + bx — a) P. 
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55. x/(x + 1)? 56. B//1—x. 57. Va — bx. 58. (x + 3)//22 — 4. 
59, a/(x? — 5). 60. 2/(22? + 3). 61. (ax + b)/(a? x? + b2). 
62.2 a — x . 63. (1 + 25). 64. 2/28 — 1. 
65. (44 2?)71 arc tania. 66. [EE , 
67. ac^", 68. 47% 69. (e — e^*). 70. (e -+ 0792, 
TL a7* b7* (a* — b”). 72. a7 I" (g?* — 1). 73. ze Ot), 
74. x 1. 75. 020% 76, Gl] 77. e*[(e* — 1). 78. e* Ya — be. 
79. x (5 — a?) 5, 80. (x3 — 1)/(x4 — 4x + 1). 81. 23/(a8 + 5). 
82. ze". 83. [3 — /2 + 3233]/(2 + 32?). 84. (3? — 1)/(x + 1). 
85.0%. 86. (1 — sin 2)/(x + cos x). 87. (tan 3x — cot 3z)/sin 3z. 
2 2 24,4 .29y-la c 1 
88. l/(zlog?z) 89.sec?z(tan?z — 2) "^. 99, (2 -- 3554) "s 
91. a" cos x. 92, a? (a 41%, 93. z(1 — at), 
94. tan? ax. 95. sin? x. 96. sec? x (4 — tan? ax)" ^. 
97. sec (xja). 98. z-1 (1+ log za) *. 99. (x — 1)" tan Vx — 1. 
100. x cosec z?.. 101. (1 + 2?) forro tan * 4. zlog (1 +2?) + 1). 
102. (sin x — cos z)/(sin x + cos x). 103. (1 — sin yg) oseo ó : 
104. a?/(z? — 2). 105, (1 + x)%/a (1 + 22). 106. sin 2a - e? *, 
107. (b — 3x) (4 — 3a?) "^. 108. (e* + 1)71 
109. [(a + b) + (a — b) a?]71, 0 b <a. 110. e"/fezx — 1. 
111. sec ax + cosec ax. 112. sin (2z z/T + D,). 113. 1/[x (4 — log? x)]. 
114, (4 — 22%) aro cosi z. 115. secta e Um, 
116. sin x cos x (2 — sint x). 117. sec? x - cosec? x. 
118. (1 — 22)" (x + are sin x). 119. sec x tan x (sec? x + 1) ^*. 
120. cos 2x/(4 + cos? 2z). 121. 1/(1 + cos? 2). 
122. (1 + 22) ^ [log {x + yz? + 1)] ^. 


Find the indefinite integrals of the functions 123—140 by changing 
the variable of integration: 

123. (a) 7? (x? — 2) ^^, (x = #73); (b) (e* + 1)7*, (x = — log t); 
(c) x (52? — 3), (53? — 3 = t); (d) s/z 4-1, (t= Vx 4-15 
(e) cos z/l1 + sin? x, (f = sin 2). 

124. x (20 + 5)%. 125. (1 + z)/(1 + Vx). 126. 1/x fox + 1. 

127. (e* — 1)7*2, 128. 271 (log 2x/log 4x). 129. (1 — 2?) ^^ (arc sin z)*. 

130. e?” (e* + 1) ^. 131. sin? x (cos x). 132. 1/z VI + x. 

133. 27/1 — x3. 134. 29/2 — a. 135, 271 Ya? — a. 

136. 271 (a? — 1), 137. £71 (a? + 1). 138. 07? (4 — x) ^. 

139. YI — a3. 140. x (1 — 2) ^^, (Put x = sin? t). 
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Find the indefinite integrals of the functions 141—160 by integrating 
by parts: 


141. log x. 142. arc tan x. 143. arc sin x. 144. xsin x. 145. x cos 3x. 

146. xe *. 147. x - 27*. 148. x? e™. 149. (x? — 2x + 5) e. 

150. 2? e^*?, 151. z sin æ cos x. 152. (a? +- 6x + 6) cos 2x. 153. 2? log x. 

154. log? x. 155.173 log x. 156.0 ** log a. 157. ware tan x. 158. xarc 
sin x. 

159. log [x + V1 + 22]. 160. x - cosec? x. 

Find the indefinite integrals of the functions 161—218: 

161. (a? + 2x + 5)71, 162. (a? + 20)71, 163. (32? — x + 1)71. 

164. x (a? — Tx + 183) 1. 165. (3x — 2)/(aà? — 4x + 5). 

166. (x — 1)2/(a? + 3x + 4). 167. a?/(a? — 6x + 10). 

168. (2 + 3x — 22?) *. 169. (x — a?) ^. 170. (a? + pz + q”. 

171. (a? + ax + bx + ab)71. 172. (x? — 5x + 9)/(a? — 5a + 6). 

173. [(x + 1) (x + 2) (x + 3)]71. 

174. (22? + 41 x — 91)/(x — 1) (x + 3) (x — 4). 

175. (5x? + 2)/(x? — 5a? + 4x). 176. x 1(x + 1)7?. 

177. (x3 — 1)/(4a3 — x). 178. (xt — 62? + 122? + 6)/(a3— 62?-- 124—8) 

179. (52? + 6x + 9)/(x — 3)? (x + 1). 

180. (a? — 8x + 7)/(x? — 3x — 10). 181. (2x — 3)/(a? — 3x + 2)?. 

182. (x? + x + 1)/x (2? + 1). 183. 21/(x* — 1). 

184. (x? — 4x + 3)71 (a? + 4x + 5)71, 185. cos? z. 186. sin? x. 

187. sin? x cos? x. 188. sin? ig. cos ig. 189. cosó zx - cosec? x. 

190. sint x. 191. sin? x cos? x. 192. sin? x cost x. 193. cos? 3x. 

194. cosec* z. 195. sec? x. 196. cos? x - cosec? x. 197. cosec? g. sect x. 

198. cosec5 x - sec? z. 199. cosec įg- sec?5 a. 

200. sin (x + im cosec z-seo z. 201. cosec? x. 202. secó 4x. 

203. tan? 5x. 204. cot? x. 205. cott x. 206. tan? (2/3) + tant (2/4). 

207. v sin?a?. 208. sinë x cos” x. 209. cosec x sec”? æ. 210. cot"? x. 

211. sin 3x cos 5x. 212. sin 10x sin 15x. 213. cos 1zcos iv. 

214. sin  zcos?z. 215. cos (ax + b) cos (ax — b). 

216. sin c x sin (w x + y). 217. cos x cos? 3x. 218. sin x sin 2z sin 3x. 


Find the indefinite integrals of the functions 219-227 by making the 
change of variable taniz--! and integrating with respect to t: 


219. (3 4- 5 cos x)~1. 220. (sin x + cos x)~1. 221. cos x/(1 + cos x). 
222. sin z/(1 — sin x). 223. (8 — 4 sin x + 7 cos x) 3. 

224. (cos z + 2 sin x + 3)71. 225. (3 sin x + 2c0s z)/(2 sin x + 3 cos x) 
226. (1 + tan z)/(1 — tan x). 227. (1 + 3 cos? x) 1. 
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Find the indefinite integrals of the functions 228—233 by changing 
the variable of integration by a substitution involving hyper- 
bolic functions: 

228. (3 — 2x — 2°)'*, 229. V2 F 22. 230. a? (9 + 22) ^". 

231. Va? — da F 2. 232. Vx? — 4. 233. Ja? rz. 

Examine for convergence (and in the cases when the integrals are 
convergent evaluate) the definite integrals 234—254: 

1 2 1 3 





























da da de dz 
234. [—  — 235. [ 7. 236. | — 237. [4 
0 —1 0 0 
1 oo œ oo 
dx dz dz dx 
238. | s 239. — 249. [E 241. | E 
0 1 1 1 
(dz s [2 2M. Fain ade. 
mo. o 28. Larra jane 
Ya de SA 
245. IE 246. Tess 247. figs a> l) 
9o lign 
248, I (a > 1) 249. [cota dz. 
a 0 
250. [e*dz (k> 0) 251, |A de 
0 0 
eo oo 1 
dz da 
252. J Gol aut x ip M EE 
vestigio the convergence of the integral 255—261: 
100 
AN: n 256. = 
3 4 3 
Va + 2/04 i i 1+5 
( du 
257. 258. | ===" 
== 1 
E m e S [ir TES 
dz sin x 
259. ic 260. i ee 261. [HZ a 
6 = — gi 1 2/2 


262. Prove that Euler's integral of the first kind (the Beta function) 
B(p, q), defined by the equation 


1 
B (p,g) = | "(1 — x) de, 
0 
is convergent if p > 0, q > 0. 
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263. Prove that Euler’s integral of the second kind (the Gamma 
function) I'(p), defined by the equation 


T (p) = far e-*dz, 
o 


is convergent if p > 0. 

264. Find the area bounded by the parabola y = 4x — a? and the 
x-axis. 

265. Find the area bounded by the curve y = log x, the x-axis and 
the line x= e. 

266. Find the area bounded by the curve y = a (x — 1) (x — 2) and 
the x-axis. 

267. Calculate the area bounded by the curve y? = x and the lines 
y = l and «= 8. 

268. Calculate the area above the a-axis of one arc of the sinusoidal 
curve y = sin x. 

269. Calculate the area bounded by the curve y = tan x, the z-axis 
and the line x = j'z. 

270. Find the area bounded by the parabola xy = m?, the lines 
x = a, x = 3a (a > 0) and the z-axis. 

271. Find the area bounded by the Witch of Agnesi y = a°/(x? + a?) 
and the x-axis. 

272. Find the area of the figure bounded by the cubic curve y = a, 
the line y = 8 and the y-axis. 

273. Find the area bounded by the parabolas y? = 2px and 27 = 2py. 

274. Calculate the area bounded by the parabola y = 2x — a? and 
the line y = —z. 

275. Caleulate the area bounded by the line y — 3 — 2x and the 
parabola y = 22, 

276. Find the area of the figure bounded by the parabolas y — a?, 
y — i2? and the line y = 2x. 

271. Find the area bounded by the curve a? y? = a? (a? — x°). 

278. Find the area enclosed by the curve (2/5)? + (y/4)"* = 1. 

279. Find the area contained within the astroid x = a cos? t, 
y = b sin? f. 

280. Find the area bounded by the x-axis and one arc of the cycloid 
x = a (t — sint), y = a (1 — cos t). 

281. Find the area bounded by one branch of the trochoid 2 = 
= at — b sin t, y = a — b cost (0 < b < a), and the tangent at 
its lowest point. 
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282. Find the area bounded by the cardioid x = a (2 cos t — cos 2t), 
y = a (2 sin t — sin 2t). 
283. Find the area of a loop of the folium of Descartes: 


x= 3at (1 +6); y= 3a/(1 + £9). 


284. Find the area of the cardioid r = a (1 + cos 0). 

285. Find the area of one leaf of the curve r = a cos 20. 

286. Find the total area enclosed by the curve 7? = a? sin 46. 

287. Find the area contained within the curve r = a sin 30. 

288. Find the area bounded by Pascal's Limacon r = 2 + cos 0. 

289. Find the area bounded by the parabola 
r — a sec? 10 and the lines 0 = 1 x and 0 =} =. 

290. Find the area of the ellipse r = p/(1 + e cos 0), (e < 1). 

291. Find the area bounded by the curve r = 2a cos 30 and lying 
outside the circle r — a. 

292. Find the area contained within the curve xt + y* = q? + y?. 

293. Calculate the length of the arc of the semi-cubical parabola 
y? = x? from the origin to the point (4,8). 

294. Find the length of the arc of the parabola y= 2 Vy from x = 0 
to x= 1. 

295. Find the length of the part of the curve y — e* lying between 
the points (0,1) and (1, e). 

296. Find the length of the curve y= log a from x= /3 to x = V8. 

297. Find the length of the curve y = arc sin (e *) from x=0 to z — 1. 

298. Calculate the length of the arc of the curve x= log sec y con- 
tained between y = 0 and y = iz. 

299. Find the length of the curve x=} y? —} log y from y= 1 to 
y e. 

300. Find the perimeter of the closed loop of the curve 9ay* = 
= g (x — 3a)?. 

301. Find the length of the arc of the evolute of the circle 
x = a (cos t + t sin f), y = a (sint —tcost) from t = 0 tot = T. 

302. Find the length of the evolute of the ellipse 


x = (eja) cos? t, y = (cb) sin? t, (2 = a? — b’). 
303. Find the length of the curve 
x = a (2 cos t — cos 2t), y = a (2sint — sin 2t). 


304. Find the length of a complete turn of the Archimedean spiral 
r — a. 
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305. Find the length of the perimeter of the cardioid r = a (1 + cos 9). 

306. Find the length of the part of the parabola r=a sec? 30 cut 
off from the parabola by a vertical line through the pole. 

307. Find the volume of the solid obtained by rotating about the x-axis 
the part of the parabola y = ax — x? which lies above the z-axis. 

308. Find the volume of the solid obtained by rotating the ellipse 
x?Ja? + y?/b? = Y about the z-axis. 

309. Find the volume generated by rotating about the x-axis the arc 
of the curve y = sin? x lying between x = 0 and x= x. 

310. Find the volume generated by rotating the figure bounded by 
the x-axis, the semi-cubical parabola y? = x? and the ordinate 
x = 1 (a) about the z-axis, (b) about the y-axis. 

311. Find the volume generated by rotating about the y-axis the 
area bounded by the parabola y? = 4ax and the line x= a. 

312. Find the volume generated by rotating about the x-axis the 
figure bounded by the curves y = 2? and y = Jz. 

313. Find the volume generated by rotating, about the z-axis, the 
closed loop of the curve (x — 4a) y? = ax (x — 3a) (a > 0). 
314. Find the volume traced out by rotating the curve y? = x3/(2a — x) 

about its asymptote x = 2a. 

315. Find the volume of a paraboloid of revolution of radius of base 
R and height H. 

316. A plane area bounded by a segment of a parabola of base 2a 
and height A is rotated about its base. Determine the volume 
of the solid so obtained (Cavalieri’s “lemon”. 

317.Show that the volume cut off by the plane x — 2a from the 
solid obtained by rotating the hyperbola a? — y? — a? about the 
x-axis is equal to that of a sphere of radius a. 

318. Find the volume of the solid obtained by revolving the figure 
bounded by one are of the cycloid x = a (t — sint), y= 
= q (1 — cos t) and the z-axis about (a) the x-axis, (b) the y-axis, 
(c) the axis of symmetry of the figure. 

319. Find the volume of the solid generated by revolving the astroid 
x = a cos? $, y = b sin? t£ about the y-axis. 

320. Find the volume of the solid obtained by revolving the curve 
r = a cos? 0 about the polar axis. 

321. Find the volume of the solid obtained by revolving the curve 
f = a (1 + cos 0) about the polar axis. 

322. Find the volume of the solid cut off from the hyperboloid 
x/a? + y?/b? — 22/c? = 1 by the planes z = 0, z = h. 
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323. Find the volume of the ellipsoid a?/a? 4- y?/b? + 2?/c? = 1. 

324. Find the area of the surface of the “spindle” which is formed 
by rotating one arc of the curve y = sin x about the z-axis. 

325. Find the area of the surface formed by rotating about the x-axis 
the are of the curve y=tan x lying between z = 0 and x=] m. 

326. Find the area of the surface obtained by rotating the curve 
y=e* (0 < x < œ) about the z-axis. 

327. Find the area of the surface generated by the rotation about 
the x-axis of the arc of the catenary y = a cos h (xja) lying 
between z = 0 and x = a. 

328. Find the area of the surface obtained by rotating the astroid 
a +. y = a about the y-axis. 

329. Find the area of the surface obtaining by rotating about the 
a-axis the part of the curve x= jy? — ¿log y which lies between 
y = l and y = e. 

330. Find the surface area of the torus obtained by rotating the 
circle z? + (y — by? = a? (b > a) about Oz. 

331. Find the surface areas of the solids obtained by rotating the 
ellipse a?/a? + y?/0? = 1 about (1) the x-axis; (2) the y-axis 
(a > b). 

332. Find the areas of the surfaces generated by rotating one arc 
of the cycloid x = a (t — sin t), y = a (1 — cost) about (a) the 
x-axis; (b) the y-axis; (c) the tangent to the cycloid at its highest 
point. 

333. Find the area of the surface obtained by rotating about the 
x-axis the cardioid with parametric equations x = a (2 cost — 
— cos 20), y = a (2 sin t — sin 2t). 

334. Find the statical moments with respect to the coordinate axes 
of the segment of the line x/a + y/b = 1 cut off by the coordinate 
axes. i 

335. Find the statical moments of a rectangle of sides a and b with 
respect to its sides. 

336. Find the statical moments with respect to the coordinate axes 
and the coordinates of the centroid of the area bounded. by 
x+y =a, x= 0, y= 0. 

337. Find the statical moments with respect to the coordinate axes 
and the coordinates of the centroid of the area in the first quadrant 
bounded by the astroid. 

338. Find the statical moment of the circle r = 2a sin 0 with respect 
to the polar axis. 
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339. Find the centroid of the arc of a circle of radius a subtending 
an angle 2a at the centre of the circle. 

340. Find the coordinates of the centroid of one arc of the cycloid 
x=a(t—sint), y = a (1 — cos t). 

341. Find the coordinates of the area in the first quadrant bounded 
by the coordinate axes and the ellipse x?/a? -+ y?/b? = 1. 

342. Find the coordinates of the centroid of the area bounded by the 
parabolas y = a, y = Ja. 

343. Find the coordinates of the centroid of the area bounded by 
one are of the cycloid x = a (t — sin t), y = a (1 — cos t). 

344. Find the centroid of a hemispherical shell of radius a, taking 
the centre of the sphere to be the origin of coordinates and the 
plane face to be the plane x 0 y. 

345. Find the centroid of a homogeneous right circular cone of height h, 
taking the origin of coordinates at the apex and the z-axis along 
the axis of symmetry. 

346. Find the centroid of a homogeneous hemisphere of radius a, 
taking the same coordinate system as in 344. 

347. Find the moment of inertia of a circular hoop of radius a about 
a diameter. 

348. Find the moments of inertia of a rectangle of sides a, b about 
its sides. 

349. Find the moment of inertia of the plane area bounded by a 
parabolic segment with base 2b and height h about its axis of 
symmetry. 

350. Find the moments of inertia of the area bounded by the ellipse 
xja? + y?/b? = 1 about its principal axes. 

351. Find the moment of inertia about an axis through its centro 
and normal to a plane of a circular ring of radii r,, r, (12 > 11). 

352. Find the moment of inertia of a homogeneous right circular 
cone of radius r and height h about its axis. 

353. Find the moment of inertia of a solid sphere of radius a and 
mass m about a diameter. 

354. Find the volume and surface area of the torus obtained by 
rotating a circle of radius a about an axis in the same plane and 
distant b (> a) from the centre of the circle. 

355. (a) Find the centroid of a semicircle using Guldin's theorem. 
(b) Using Guldin’s theorem, prove that the centroid of a triangle 
is at distance ¿hfrom the base, where h is the height of the 
triangle. 


CHAPTER IV 


SERIES. APPLICATIONS TO APPROXIMATE 


EVALUATIONS 


$ 12. Basic theory of infinite series 


118. Infinite series. Let an infinite sequence of numbers be given: 


Uy, ss Ugs sos Uny es 
By taking the sum of the first n terms 

Sn = 41 + UH... F Un, 
we can get another infinite sequence of numbers 


TN TENEO ie 


If sn tends to a (finite) limit on indefinite increase of n : 


s = lim sn, 


no 


we say that the infinite series : 


uud. bint. 


converges and has the sum s, and we write : 


$ — 4 Wd... Vnd eee 


(1) 


(2) 


(3) 


(4) 


On the other hand, if s, does not tend to a limit, we say that the infinite 


series (3) diverges. 


In other words the infinite series (3) is said to converge, if the sum of 
its first n terms tends to a limit on indefinite increase of n. We call this 


limit the sum of the series. 


We can only talk about the sum of an infinite series when it is con- 
vergent, in which case the sum s, of the first n terms gives an apprex- 


315 
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imate expression for the sum s of the series. The error r, involved 
in this approximation, i.e. the difference 


Tg = 8 — Sn; 


is called the remainder of the series. 
The remainder r, is evidently itself the sum of an infinite series, 
obtained from the given series (3) by neglecting its first n terms: 


Tn = Un yr F Unga cb + F Untp H. 

This remainder cannot be found accurately in the majority of cases, 
so that it is important to know the approximate error due to this 
remainder. 

The simplest example of an infinite series is the geometrical progres- 
sion: 

a+aq+ag?+...-+ag™*> +... (a 0) (5) 

We consider separately the cases: 

la| <L [g] >1, q=1L q=-1. 


We know [27] that a geometrical progression has a finite sum 
s=a/(1—q) for |g | < 1, so that the series is convergent; here, in fact: 


a — aq" 


8, =ataq+t... + aq = ey 





and s — Sn — 0 as m — co, since qn — 0 for |g | 1 [26]. Obviously, 
Sn > “o for & — co with |q | > 1, since now q” — co [29]. We have 
Sn = an for q = 1, and evidently s, > © again, so that a geometrical 
progression is divergent for |q | > 1 and g = 1. We get the series, 
with q = —1: 

a—-ata—act... 


The sum s, of its first n terms is zero for n even, and a for n odd, 
i.e. 8, does not tend to a limit, and the series is divergent; though 
contrary to the previous case, the sum remains bounded for all n, 
since it only takes the values 0 and a. 

If the absolute value of sn, the sum of the first n terms of series (3), 
tends to infinity on indefinite increase of n, series (1) ts said to be 
strictly divergent. In future, we shall simply speak of a ‘divergent 
series” for brevity, when a strictly divergent series is meant. 
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119. Basic properties of infinite series. Convergent infinite series 
have certain properties that allow for operations being performed on 
them, as on finite sums. 

I. If the series 

w F U t... E 


has the sum s, the series 
QU, F QU, +... FH aus +... (6) 


obtained by multiplying all the terms of the first series by the same factor 
a, has the sum as, because the sum oc; of the first n terms of (6) is 


On = aU, + AU, + ... + aUn = 85, 


and hence 
lim c, = lim as, = a lim s, = as. 


n-» 00 n=% n>0 


II. Convergent series can be added and subtracted term by term, i.e. if 


Uy + Ut... $ Unt... =8, 
vid v. t... Fort... =a, 
the series 
(wu, t v) + (t E my) +e. + (Un vu). (7) 


is also convergent, and its sum is (s + a), since the sum of the first n 
terms of (7) is 
(uy +01) + (Uy +] 0) + ~~. + (Un E On) = 8n + On. 

Other properties of sums, for instance the independence of the sum 
on the order of the terms, the rule for cross-multiplying two sums, 
etc., will be considered in regard to infinite series in $ 14 below. We 
remark for the present that these properties do not hold for every 
series. The associative law is evidently valid for any convergent series, 
i.e. we can regroup the terms of the series as desired. This amounts to 
taking only part of the s, instead of all the s, (n = 1, 2, 3, ...), which 
does not alter the limit s. 

III. The convergence or divergence of a series is unaltered by removing 
or adding a finite number of terms at the beginning. We shall take the 
two series: 


Uy + Uy + Ug Fwd... 
Wa + Uy + Us t+ Ug +... 


The second is derived from the first by removing the first two terms. 
Let the sum of the first n terms of the first series be denoted by s,, 
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and of the second series by o,. Evidently: 
On—2 = 8n — (uy + V), Sn = On-2 + (uy + Uy) ; 


whilst subscript (n — 2) — co as n — co. Hence, if s, has a limit, 
ng also has a limit, and conversely. These limits, s and o, i.e. the 
sums of the above two series, will of course be different, and in fact: 
o= 8 — (u, + uj). 

IV. The general term u, of a convergent series tends to zero on inde- 
finite increase of n : 

lim u, = 0, (8) 

since obviously, 


and if the series converges and has the sum s, 


lim s,_, = lim sr = 5, 
so that 
lim u, = lim s, — lim Sn; = 8 —s = 0. 


Condition (8) is thus necessary for the convergence of a series, but 
it is not sufficient: a series as a whole can diverge, whilst its general 
terms tends to zero. 


Example. The harmonic series is: 


fete te te ec—ES (9) 


eo 
n=] 


We have here: 


1 
Un = — > 0 for n > co. 
n 


It can easily be shown, however, that the sum of the first n terms of (9) 
increases indefinitely. We take the terms in groups of 1, 2, 4, 8, ... terms, 
starting with the second: 


1 1 1 1 1 1 1 
rr ++ (+ +3)+ 
so that the kth group contains 2*—! terms. If we replace all the terms of each 
group by the last, this being the smallest of the group, the resulting series: 


1,1 1 
IES E A les (10) 


evidently tends to (+00), the sum of its first n terms being [1 + !/, (n — 1)]. 
We can take any desired number n of groups by taking a sufficiently large 
number of terms of (9), and the sum of the terms of (9) is greater than [1 + 
+ 3f, (n — 1)], whence it follows that s, > +o for the series (9). 
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120. Series with positive terms. Tests for convergence. Series with 
positive (non-negative) terms have a special importance; here, all 
the terms 

Uy, Ug, Ug, oe, Un, s >0. 


We establish a number of tests for convergence and divergence of 
such series. 
1. A series with positive terms can only be convergent, or else strictly 
divergent, i.e. 
8n—> 8 Or 85, ->+ co. 


A necessary and sufficient condition for the convergence of a series with 
positive terms is that the sum 8n of its first n terms remains less than a 
given constant A for any n. 

The sum s, of the series cannot decrease as n increases, since new 
positive (non-negative) terms are then added; all our assertions now 
follow from the properties of increasing variables worked out in [30]. 

It is often useful, when discussing the convergence or divergence of 
a series with positive terms, to compare it with other, simpler series, 
and especially with a geometrical progression. 

Wo therefore establish the following test: 

2. If each term of a series with positive terms 


Ss EN (11) 


as from some given term, does not exceed the corresponding term of a con- 
vergent series 


94 F nt... e Leu (12) 


the given series is convergent. 

If, on the contrary, each term of (11), as from a given n, is not less 
than the corresponding term of a divergent series (12), the given series is 
also divergent. 

We first take 

Un S Un > (13) 


with series (12) convergent. We can suppose without loss of generality 
that this inequality is true for all n, since the first terms to which it 
does not apply can be neglected if necessary (property III [119]). 
Let the sum of the first n terms of (11) be denoted by sn, and the cor- 
responding sum for (12) by øn; then we have by (13): 


Sn S On. 
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But (12) converges by hypothesis, and denoting its sum by o, 
we have: 
On XO, 
so that we also have: 
$ < 0, 


whence the convergence of (11) follows, by 1. 
We now take 
Un > Us. (14) 
We evidently have 
Sn > 05; (15) 


but series (12) is now divergent, and the sum o, of its first n terms 
can be made greater than any previously assigned large number; 
by (15), s, has the same property, i.e. series (11) is also divergent. 

Remark. The convergence (divergence) of series (12) implies the 
convergence (divergence) of the series 


kv, + kv + hog t+... + hon +... 


where k is any positive constant number. 

The convergence of the series Z kv, follows from the convergence 
of Xv, by I [119]. Conversely, if Zo, is divergent, X kv, must be 
divergent, since, supposing it were convergent, and multiplying its 
terms by 1/k, we should have the convergence of 2v, by I [119]. 
It follows from what has been said that: 

Series (11) is convergent, if 


Un < kon, (16) 
where the series Xv, is convergent and k is any positive number ; (11) is 
divergent, if 

Un > kvn, (17) 


where X v, is divergent. 

By comparing a given series with a geometrical progression, we 
obtain two fundamental tests for the convergence of series with 
positive terms. 


121. Cauchy's and d'Alembert's teste. 
3. Cauchy's test. If the general term of the series (11) with positive 
terms : 


9, + us d.d Mab eeu 
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satisfies, as from some given n: 


n 
Vu, <g<1, (a) 


where q does not depend on n, the series is convergent. 
On the other hand, if we have, as from a given n, 
n 


Ju, >1, (19) 

series (11) is divergent. 
We can assume that inequality (18) or (19) is satisfied for all n 
without loss of generality (property 111 [119]). If (18) is satisfied, 


Un <q", 


i.e. the general term of the given series does not exceed the correspond- 
ing term of an infinite diminishing geometrical progression, whence, 
by test 2., the series is convergent. Whereas we have in the case of (19): 


Un >i, 


and series (11) has a general term which does not tend to zero (it 
is greater than unity), so that the series cannot be convergent (pro- 
perty IV [119)). 

4. d'Alembert's test. If the ratio unfun of two successive terms of a 
series satisfies the inequality, as from a given n : 





u 
Zu a (20) 


n-1 
where q does not depend on n, series (11) is convergent. 
On the other hand, if we have, as from a given n : 


Un 





>l, (21) 


Un-i 
the given series is divergent. 
We can make the same sort of assumption as before, that inequality 
(20) or (21) is satisfied for all n; we have in the case of (20): 


Un X Uni 9 Un, X Una Qs -- -, Uy < Uf, 


whence, cross-multiplying term by term and cancelling the common 
factors, 
Un < Uy q'5 


ie. the terms of the series are less than the terms of the diminishing 
geometrical progression: 


ud-uqdug-4...-duqoc4x..(0«gq«1, 
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and (11) therefore converges, by test 2. In the case of (21): 
Uy X Uy S Ug S ... X Un_1 S Un ..., 


i.e. the terms of the series do not decrease, so that u, does not tend 
to zero for n —> “o, and the series cannot be convergent (property IV 
[119]. 


COROLLARY. If 


Un 


ie 
Vu, or F 





(22) 


n-1 


tends to a finite limit r on indefinite increase of n, the series 


ui tug... und. 


is convergent for r < 1, and divergent for r > 1. 
Suppose first r « 1. We take a sufficiently small number e such 
that also 
r+e<l. 


n 
For large n, Yu, or un/un-, will differ from the limit r by not more 
than e, i.e. we shall have, starting from some sufficiently large n: 





n 
r=e< lu, <r+e<1l (23,) 
or 
pga ei ges p poe. (23,) 
Un-1 


Applying Cauchy’s or d’Alembert’s test with q =r + e < 1, we 
can infer at once the convergence of the given series from (23,) or (23,). 
The proof of divergence is similar, in the case when r > 1. The series 
is evidently also divergent if one of expressions (22) tends to (+o). 


Examples. 1. Consider the series 





x x? a” : S qn 
+... —93———- toe imm — 24 
E We ubt E mi (24) 
Applying d’Alembert’s test: 
= n = gil Una z5 x 
Uni ml? Un Gann? a ADS on y 


so that the given series converges for all finite (positive) v. 
2. Consider the series 


(25) 
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Here we have: 





so that, by d’Alembert’s test, the given series converges for 0 < x < 1 and 
diverges for x > 1. 
3. Consider the series 


eo 


> sin? na, (26) 


n=1 


We have, on applying Cauchy’s test: 


n n 
Un = sin2na, lu, —r/sintna«r, 


so that the series is convergent for r < 1. 
d’Alembert’s test gives no result in this case, since 


Un mr | sin na y 
Un-1 sin (n — 1) a 
neither tends to a limit, nor remains always < l or > 1. 

It can be shown that, in general, Cauchy’s test is stronger than d’Alembert’s 
test, i.e. Cauchy’s test can always be applied when d’Alembert’s can, and 
often when it cannot. On the other hand, d’Alembert’s test is easier to use, 
as becomes evident from an inspection of the first two examples worked 
out above. 

We remark further that there are cases when both Cauchy’s and d’Alem- 
bert’s tests fail, as, for instance, whenever 





n 


Vu, and 


ie. when r = 1. We are concerned here with a doubtful case, when the ques- 
tion of convergence or divergence has to be decided by some other means. 
For example, in the case of the harmonic series 


2 
n=1 


Un 
>1, 
n-i 





, 


ale 


which we have seen to be divergent in [119], we have: 





n 
a 1 1 

Un _n—l mal e = log = 

we Bl, V= Poot Lt 


flIt must be noted in the above working that, putting z = 1/n,x > 0 
and (1/n) log 1/n = x log x > 0 [66]. Hence, on taking the logarithm of the 
n 


expression /1/n, we see that it tends to unity. 
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and therefore the question of the convergence or divergence of the harmonic 
series cannot be decided by means of Cauchy’s or d’Alembert’s test. 
We show later that the series 


m1 1 1 1 
Ap ae Page S 


is convergent. 
But we again have in this case: 


n A n E 
ta (PI le E) 1 
o e al m 
i.e. another doubtful case, if Cauchy’s or d’Alembert’s test is applied. 


122. Cauchy’s integral test for convergence. We assume that the 
terms of the series: 


U + Uy tugt...+ Unt... (27) 
are positive and non-increasing, i. e. 
U SU... DUn > Uny... >O. (28) 


We represent the terms graphically by plotting n as the in- 
dependent variable, which as yet only takes integral values, along 
the axis of abscissae, and the corre- 
sponding values of u, on the axis 
of ordinates (Fig. 155). A continuous 
function y=f(x) can always be found, 
such that it takes precisely the va- 
lues unfor integral values of x = n; 
in fact, it is only necessary to draw 
X a continuous curve through all the 
points plotted; we shall assume that 
the function thus obtained is non- 
increasing. 
With this graphical representation, the sum of the first n terms 
of the series, 






HE 


Fie. 155 


Sn =U F w, H... HUn, 


is represented by the sum of the areas of the exterior rectangles 
in which is contained the area of the figure bounded by the curve 
y = f(x), axis OX, and ordinates x = 1 and x = n + 1, so that 
nil 
8.2 | f(r)dz. (29) 


1 
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On the other hand, this latter figure contains within itself all the 
interior rectangles, the sum of the areas of which is: 


Uy + Us F Ug +.. Un = 8n41 — 95, (30) 
so that 
ntl 
841 — Uy < j f(x) dz. (31) 
These inequalities lead us to the following test. 
5. Cauchy’s integral test. The series (27), 


44 a+. FAR.) tr=f(n), 


with positive terms that do not increase with increasing n, converges 
or strictly diverges, according as the integral 


I= | f(x) de (32) 


“tag 


is finite or infinite. 
We recall that the f(x) here must decrease with increasing x. 
Firstly, let integral I be finite, i.e. the curve y = f(x) has a finite 
area [98]. It follows from the fact that f(x) is positive, that 


n+1 


| Nejde < j f(a) de, 


and hence, by (31): 
Sn < Sny SU +1, 


i.e. the sum s, is bounded for all n, and (27) is convergent, on the 
basis of test 1 [120]. 
We now take I = co, i.e. the integral 


n+1 


f f(x)dx 


can be made greater than any previously assigned number N by 
increasing n. By (29), s, can also be made greater than N, i.e. (27) 
is strictly divergent. 

It can similarly be shown that the remainder of series (27) does not 
exceed the integral 


T f() da. 
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Remark. The integral I in Cauchy's test can be replaced by the integral 


j f(x) da 


where a is any positive number, greater than unity. 

In fact, if the curve y = f(z) has a finite area on measuring from 
the ordinate x = 1, its area will be finite on measuring from any 
ordinate x = a, and vice versa. If I = co, we speak alternatively of 
integral (32) diverging. 


Examples. 1. Consider the harmonic series: 


Sul 
P n 
Here we have: 
1 
fin) = “a , 
so that we can put 
f(z) => 


then 





and the integral diverges, since log x> --oo as > +00; thus the series 
diverges, as we have already seen. 
2. Consider the more general series 


MR | 
Zw (33) 


where p is any number greater than zero (the series obviously diverges for 
p « 0). Here we have: 

1 
1—p 





° ol pel, 
1 1 de 1 
fn) => Iur) = gs I[m- 
1 


log x tp =1 : 





Hence ib is clear that the integral diverges, if p « 1, and is convergent 
to l/(p — 1), if p >1. We have in the latter case the exponent 1 — p< 0, 
ai? = lp > 0 as z — +o, and thus 


l ip lr ] 1 
rap? 944 p 
Hence, by Cauchy’s test, series (33) is convergent for p > 1, and divergent 
for p « 1. 
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123. Alternating series. We now turn to series with either positive 
or negative terms, and we first take the alternating series, the terms 
of which are alternately positive and negative. Such series are more 
conveniently written in a new form: 


Uy — Ug + UU E Un Un es (34) 


where the numbers 
44, Uy, Ug, «+» Un + 


are reckoned positive.t 

The following proposition can be proved regarding alternating 
series: 

A sufficient condition for the convergence of an alternating series is 
that the absolute value of its terms decrease and tend to zero with in- 
creasing n. The absolute value of the remainder of the series does not 
exceed the absolute value of the first of the neglected terme. 

We first take the sum of an even number of terms of the series: 


San = Uy — Ug + Uz — ... + Unc — Uan: 


Since, by hypothesis, the absolute values of the terms decrease 
(or it is better to say, do not increase) with increasing n, in general: 


Uy > Uy yy ANA Wonyi — Vente, > O, 
and hence 
Senge = San F (Venti — Unte) > San > 


i.e. the variable s,, is non-decreasing. We have on the other hand: 
San = Uy — (Ug — ug) — (Uy — us) — ©.» — (Manca — Ugn—1) — Uan < Uy, 


since all the differences in brackets are non-negative; i.e. the variable 
Sn remains bounded for all n. Hence it follows that, on indefinite 
increase of n, s,, tends to a finite limit [30], which we denote by s: 


lim sgn = s. 
N+ 
We have further: 
Santi = San + Uanti> 8 as N—-roo, 


since Un.,—-> 0 by hypothesis. 


+ We assume bere that the first term of the series is positive; if it is negative, 
the series is written as — 4, + Ug — Uz +... 
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We thus see that the sum of both an even and odd number of 
terms of series (34) tends to the same limit s, i.e. (34) is convergent 
and has the sum s. 

The remainder 7, of the series has still to be found. We have: 


Tn = + Unyi F Unga E Unts +++ > 


where either the upper or lower signs have to be taken simultaneously. 
Alternatively, 
Tn = + (Unti — "nga T Unta — +++) 


whence, by the same argument as before: 


l 


|a | = (saa — nag) + (Unts — Unta) + + > 
= paa — (Unto — Unas) — +++ < Ungis 


which it was required to prove. 
It follows from the formula: 


t= d [Qai — Unto) + (Uns — Unta) + -- J , 


where the square brackets contain a non-negative quantity, that the 
sign of r, is the same as that to be taken in front of the square brackets, 
i.e. is the same as the sign of +4, ,, Thus, with the hypotheses laid 
down in the theorem, the sign of the remainder of an alternating series 
is the same as that of the first neglected term. 


Example. The absolute values of the terms of the alternating series 


1 1 1 
1— oO + 3 == 4 + ... 
decrease indefinitely as n > co, so that the series is convergent. We see 
later that its sum is log 2. The series is not suitable, however, for the actual 
computation of log 2, since we should have to take 10,000 terms to obtain a re- 
mainder less than 0.0001: 
1 
Ifa] < aa 00008 n > 10,000. 

This series, though convergent, converges very slowly ; in order to deal 
with this type of series in practice, a preliminary transformation is needed, 
so as to pass from the slowly convergent to a rapidly convergent series, or, 
as we say, to improve the convergence. 


124. Absolutely convergent series. As regards other series with terms 
of any sign, we shall restrict our attention to those that are absolutely 
convergent. 

The series 


Uy + U + Ug tpt... + Unt... (35) 
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is convergent, if the series consisting of the absolute values of its terms 
ts convergent, i.e. if the series 


¡a [+ || + vs m [+...+[un[+..-. (36) 


is convergent. 
A series of this sort is called absolutely convergent. 
Let us suppose that (36) is convergent; we put 


On => ts] + tn)» Wy = 5 (Un | — 1). 


Both v, and w, are non-negative numbers, since obviously 


Un, df w20, 0, if u,20. 
Un = Wn = 
0 2 Un <0, [unlo » Un<0. 

On the other hand, neither v, nor wn exceeds | u, |, which is the 
general term of the convergent series (36), and hence, by the second 
test for the convergence of series with positive terms [120], both 
series 


co oo 
> Un; D Wn 
n=1 n=l 

will be convergent. 

Since we have: 


Un = Un — Un, 


the series 
oo oo SA oo 
> us A (vs wu) = > Uy — > Us, 
n=1 n=1 n=1 n=1 


got by term by term subtraction of (Sw, from X v,a, is con- 
n=1 n=l 
vergent [119]. 

Convergent series with positive terms represent a particular case 
of absolutely convergent series. Convergence tests for these latter 
are obtained directly from the tests for series with positive terms. 

Convergence tests 1—5, deduced in [120, 121, 122] for series with 
positive terms, can be used for series with terms of any sign, provided 
only that we agree to replace un everywhere by | Un |. With this proviso, 
divergence tests 3 and 4 remain valid, together with their corollary [121]. 

In particular, in formulating Cauchy’s and d’Alembert’s tests, we 
have to replace: 


Un 


n n 
Vu, and um by |/ju,| and 
us 








Un-1 
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If, for instance, | uua; | <g<1, ie. | un || wi] <g< 1, 
series (36) with positive terms is convergent by d’Alembert’s test 
[121], and series (35) is therefore absolutely convergent. On the other 
hand, if | Ua/Un-1 | > 1,1. 6. | us | > | vax |, the term un cannot decrease 
in absolute value with increasing n, and therefore cannot tend to 
zero, and series (35) is divergent. Hence it follows, as in the corollary 
of [121], that if | u,/u, . |r < 1, series (35) is absolutely convergent; 
whilst if | up/un_1|—>r > 1, series (35) is divergent. 

Remark. We note further that, if the absolute values of the terms of 
a series (35) are not greater than the positive numbers An, | Un | < an, 
the series of these numbers being convergent, ie. a, + a+... -+ 
+ Gn +... is convergent, series (36) is them certainly convergent 
[120], i.e. series (35) is absolutely convergent. 


Examples. 1. The series (Example [121]) 
eo qn 


2 a 


n=1 
is absolutely convergent for all finite x, either positive or negative, since 


Yati] lel >0 
Un n 
for all finite zx. 
2. The series 


48 


a" 
n 


i 
- 


n 
is absolutely convergent for |x| < 1, and divergent for |2|> 1, since 


n— 
n 


“a L|0]>|0]. 














i Un-1 
3. The series 


co 
> r" sin na 
n=l 


is absolutely convergent for | r| < 1, since then: 





(Tu=) iri snva| «V Iro ir «1. 


It must be noted that by no means every convergent series is also absolutely 
convergent, i.o. remains convergent after replacing each term of the series 
by its absolute value. For instance, the alternating series 


1 
no C gte 
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is convergent, as we have seen; whereas on replacing each term by its absolute 
value, we get the divergent harmonic series: 


1 1 1 
cad cub as 


Absolutely convergent series have a number of interesting pro- 
perties, discussed in the small print section $ 14. For example, they 
alone have a property of finite sums, namely independence of the 
sum on the order of the terms. 


125. General test for convergence. We conclude the present section 
with some remarks on the necessary and sufficient condition for con- 
vergence of the series 


Uy cb UY + Ug Hess HUrt 


Its convergence is, by definition, equivalent to the existence of a 
limit for the sequence: 


E BR 


where s, is the sum of the first n terms of the series. But we have 
Cauchy's necessary and sufficient condition for the existence of this 
limit [31]: 

For any given positive e there exists an N, such that 


|8m —Sn| <€ 


for all m and n > N. We take m > n for clarity, and let m = n + p, 
where p is any positive integer. Noting that now 


Sm — Sn = Sn+p — Sn = (Uy + Ua +... +H Un F ya +++ Muay) — 
— (ty + ug +... HUn) = Unga F Ungo- np, 


we can put forward the following general test for the convergence of 
a series. 

A necessary and sufficient condition for the convergence of the infinite 
series 


U, F Ug FH Ug tH -e VOR B. 


is that for any previously assigned positive e there exists an N, such 
that, for any n > N and for any positive p, 


| Unti + Unte + san + Untp| <E, 
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ie. starting with Un, the absolute value of the sum of any number of 
subsequent terms of the series remains less than e as soon as n >N. 

It must be remarked that this general test for the convergence of 
a series, though of great theoretical importance, is usually difficult 
to apply in practice. 


§ 13. Taylor”s formula and its applications 


126. Taylor's formula. We take a polynomial of degree n: 
f(x) = ag +0 24084... 4407; 


we give x an increment h and find the corresponding value of the 
function f(z + h). This value can evidently be expressed in powers 
of h, by expanding the various powers of (x + h) by means of Newton's 
binomial formula and re-arranging the result in powers of h. The 
coefficients of these powers of h will be polynomials in x: 


f (w+ h) = Ag (2) + hA, (2) + EAS (2) +... + 
+ AA, (x) +... + A^ A, (2), (1) 
and we have to determine: 
A(x), Alx), ..., An(x). 


We first change the notation, writing a instead of x, and x instead 
of (x + h), in the identity (1). We now have 


h—r-—a, 
and (1) becomes: 


Í (2) = As (a) + (x -- a) A, (a) + (c — a As (a) +... + 


+ (x — a)*A, (a) +... + (x — a)" A, (a) . (2) 
We find A,(a) by putting x= a in this identity, which gives: 
f(a) = Ala). 


We find A,(a) by differentiating (2) with respect to x then putting 
v= a: 
F (1) = 1x Ala) + 2 (x — aj Aya) +... + k(x — a) Ala) +...+ 
+ n(x — aA, (a), 
f (a) 21x A, (a). 
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We differentiate a second time with respect to x, then put x = a, 
and find A,(a): 


FP (2) =2x142(0) +... + k (k — 1) (x —a)*? A, (a) +... + 
+ n (n — 1) (x — ay A, (a). 
f" (a) = 2x14, (a). 
Continuing these operations, we differentiate b times with respect 
to x then put « = a, and get: 
f? (x) 5 k (k — 1)...2x1A,(a) +... + 
+n(n—1)...(nm—k+1)(2—a)* A, (a), 
f? (a) = k! A, (a). 
We thus have: 


A,(a)=f(a), Am - I. Aq - I... 





(k) n) 
4,(a) =H... as = 0. 


kp v5 ni 


so that (2) now reads: 





f) =f) +O a+ LO apy... + 
(k) ta, (n) 
+ ES. @ aye iu 0 (e ay. (3) 


This formula is only true when f(x) is a polynomial of degree not 
greater than n, in which case it gives an expression for the poly- 
nomial in powers of the difference (x — a). Let f(x) be any given func- 
tion, allowing for differentiation up to and including the nth order. 
Let E,(x) be the error obtained by taking the right-hand side of (3) 
for f(x), i.e. we have: 


f(a) = f) + £2) (a a) + EA (s a+... + 





(2) (a 
+ E70) (e ay + B, (2). (4) 


We assume that f(x) has a continuous derivative of order (n + 1) 
in some region of variation of v including x = a, and we find R,(x) 
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in terms of this derivative. Differentiating identity (4) once, twice, 
. . ^ times, we have: 











Mo — f'(a) + Po (r—2)--...4 D (z — ay + ) 
+ En (x). 
“la (n) 
f't) t2) + I (o La) +... + LEO ean + as 
+ En (2), 


fos ) = f? (a) + EX (a). 
Putting x = a in (4) and these last equations, we find: 
R,(a)=0, Rr(a)=0, Rga)-0. (5) 
We find on again differentiating the last of equations (4,): 
RED (x) = [04D (a). (6) 


We obtain an expression for R,(x) without difficulty from (5) and (6), 
since by a basic formula of the integral calculus: 


Fi, (2) = (m (t) dt = aTe (D d (x — 1) 
= — En (t) (x — t) e [ree (t) (£ — 1) dt = — fmo gaez LE - 





a 








— Br, (t jE am al «f Ry’ (t je = al di = Es -f n )d e nu E = 





x * 
= = Ry ee + frog ELat=...= 
g a 


iz apo (0 = dt = E | {OTD (t) (x — t)^ dt. 


a a 
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The above working is explained as follows. The variable of integra- 
tion is denoted by t, so that x is to be taken as a constant under the 
integral sign, and the differential of x as zero; and hence, for example, 





dae o 3er» quon) €: a, 


and in general, 


(x — ga 


Gor de 


(z— t) eat 
UI 7 aa S 


The expression 


RY (t) 


k 
EE esa 


a 
is zero because the factor (x — t)“ vanishes on substituting t = x 
whilst the factor RÍ?(a) = 0 by (5) on substituting t = a. 
We obtain in this way the following important proposition: 
TayLor's FormuLa. Every function f(x), having continuous derivatives 
up to and including order (n + 1) in some interval including x = a 
as an interior point, can be expressed in powers of (x — a) for all x inside 
the interval, as 





f(a) = f(a) + (w@—a) A + pa ER + 
4 (s — ar + p, (a), (7) 


where the remainder term R,(x) has the form : 
1 x 
Rr (a) = sy | Oe tat. (8) 
a 


A second form of the remainder term is often encountered in appli- 
cations, this being derived at once from (8) by using the mean value theo- 
rem of [95]. The function (+ — t)" under the integral on the right-hand 
side of (8) preserves its sign, so that we have by the mean value theo- 
rem: 


ODE á f» (a — ttt x 
R, (2) == 2 fei) a= |- EA 
a 


We obtain on substituting the upper and lower limits: 


(ua ha Gea 
n+l ja n+l "' 
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since the expression written vanishes for t = x. Substituting this in 
the above formula, we have: 
(14-1) E) 

R, (a) = (a — a , (9) 
where ¿is a mean value lying between a and x. This result is known 
as Lagrange's form of the remainder term, and Taylor's formula with 
Lagrange's remainder reads: 


f (9) —1(2) 4- (c — a) X9. y (s — ay PO e 4 
™ (a n+1) 
+a EA 4 (@ — aye P m) 


(€ between a and z). 


127. Different forms of Taylor's formula. Putting n = 0 in (7,) gives 
us Lagrange's formula for finite increments, deduced earlier in [63]: 


F(x) — fla) = (x — a) f'(5; 


Taylor's formula is thus seen as a direct generalization of the formula 
for finite increments. 

If we return to our previous notation and write x instead of a and 
x + h instead of x, Taylor's formula (7) becomes: 


Tern 1) =D PRO. PA, Q0 


since (x — a) is to be replaced by h in the new notation. The value £, 
lying between a and x in the previous notation, will lie between x 
and (x + h), and can be written as (æ + 0h), where 0 — 60 «1. 
Using (9), the remainder term of formula (10) can thus be written 
in the form: 
T1) 
R, = pes PI e m (0 —0 <1). (11) 
The left-hand side of (10) is the increment Ay of the function 
y = f(x), corresponding to the increment, or what amounts to the 
same thing, to the differential A of the independent variable. Recalling 
the expressions for higher order differentials [55], we have: 


dy =y de = f'(z) x h, d? y =y"(de)? = f'() x M, .... 
d'y = yda)" = fax) x W^ , 
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whence 
Sy d'y dti 
dye ae ee lp (12) 
the symbol: 
d™tly 
(n+ 1)! ix eon 


denoting the result of substituting z + 0h instead of x in the expres- 
sion: 

dit: y 

(m+ 1)! , 


Taylor's formula is of particular interest in this form when the 
increment h of the independent variableis an infinitesimal. The infin- 
itesimal terms of different orders with respect to h in the increment 
Ay of the function can then be separated out by using (12). 

The initial value a of the independent variable is commonly zero, 
in which case Taylor's formula (7) takes the form: 


fa) =fO)+eL 4 EDI. pan LO + p (a), (13) 





where 


ght forty (E) git fü (82) 


(n1! ^ — (n-c10l (14) 


R, (2) =r [109 (0) (æ 1)" dt = 
ò 
and £, lying between 0 and x, can be written: § = 6x, where 0 is a 


number satisfying 0 < 0 < 1. Formula (13) is called Maclaurin's 
formula. 


128. Taylor and Maclaurin series. If the given function f(x) has 
derivatives of all orders, Taylor’s and Maclaurin’s formulae can be 
written for any n. We re-write (7) in the form: 


d (n) 
ta) — [ft + @—a) HP 4 ay ET. +00 | = 
== f(x x) — Sue = = En (x 2), 
where Sa+1 is the sum of the first (n + 1) terms of the infinite series 

(n) 
Ha) + (a — a) EO Pa ap LO y, 


n+1 fn)» (a) 


+ td oi 
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If on indefinite increase of n: 


lim R, (2) = 0, (15) 
now 
the series just written converges, by what was said in [118], and f(x) 
is seen to be equal to the sum S of this series. We thus obtain an ex- 
pansion of the function f(x) into an infinite Taylor power series 


ra= te- h.t fL... Qe) 


in powers of the difference (x — a). 
Maclaurin's formula similarly gives us, when condition (15) is 
satisfied: 


F= 4-2 EQ 4... pan 0L, (11) 


By evaluating the remainder term R, as a function of n, we get the 
error involved in taking the sum of the first (n + 1) terms of the series 
as f(x) instead of the complete series. This is of great importance in 
the approximate evaluation of a function by means of its expansion 
into a power series, which is the method most commonly used in 
practice. 

We apply the above working to the expansion and approximate 
evaluation of some simple functions. 


129. Expansion of e*. We have in the first place: 


f(a) =e, f(m|-e..s. fomee,.., 


so that 





fO) =f'(0) =... =f) — 1, 


and Maclaurin’s formula with remainder term (14) gives: 
x zi a” gl 
te =t tate tartas (0 « 0 « 1). 


We have seen (Example [121]) that the series 


d n 


x 
2a 


x=1 
is absolutely convergent for all finite x, and hence we have for all a: 


gttt 


myy >? as N>%0, 
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since this expression is the general term of a convergent series.f 
On the other hand, the factor e* in the expression for the remainder 
certainly does not exceed e* for x > 0 and unity for x < 0, and the 
remainder therefore tends to zero for all x. This gives us the expansion: 


a trt te (18) 


which is valid for all x. 
In particular, putting x = 1, we get an expression for e which is 
very convenient for evaluating e to any degree of accuracy: 


e=ltsptopt-. tate. 


We use this formula to evaluate e to six places of decimals, If we take the 
approximation: 


1 1 
exa apts Tap 
the error will be: 


1 1 1 1 1 
2. 1 " 

(n3 DE * (337 aq tay Fara? |< 
1 1 1 1 1 


1 
wil tar to] C 
^" A+] 


where the (<) sign comes from replacing the factors (n + 2), (n + 3), 
(n + 4), ... by the smaller numbers (n + 1) in the denominator of the frac- 
tion, so that the fraction is increased. 

We can thus say that e is included between the following limits: 


1 1 1 


1 1 
2 GTA tap se<2+ 31 +... nl + nin . 























If we want to find an approximate value for e, differing from the true value 
by not more than 0.000001, we put n = 10; then 


J 1 1 
ex2 + opt grt bor: 


and the error does not exceed 1/11! 10 < 3x 10-8. The first two terms in 
this formula are calculated accurately; the remaining eight terms must be 
calculated to seven places, since the error of each term is then not greater 


than 0.5 of unity at the seventh place, i.e. 0.5 x 1077, and the total error is 


not greater than: 
1077x0.5x8 = 4x10-7, 


i.e. four times unity at the seventh place. Hence, the absolute value of the 
error as a whole does not exceed 4.3 x 1077, 


T Cf. also the example in [34]. 


340 SERIES. APPLICATIONS TO APPROXIMATE EVALUATIONS {130 





We have: 
2 = 2.000000 0 (accurate) 
1 1 
ape > 0.500000 0 » 
1 1 3 
31 ary — 0.166666 7 (high value) 
l =-1-=0.041666 | 7 
a = 31a = . ” 23 
1 1 
FT = am =0-008333 3 (low value) 
1 1 A e ~ 2,7182818 
ST = grg = 0-001388 9 (high value) 
1 1 
a Caeser saa 0.000198 4 (low value) 
d. =~ = 0.000024 | 8 
8t 718 ^" D i 
1 1 : 
yr = Big ^ 0.000002 8 (high value) 
1 1 


The value of e to 12 places is 2.718281828459. 


130. Expansion of sin x and cos x. We have [53]: 
f(z)=sing, (0) =sin(e +37), s /0()=sin(e + kz], 
whence 
f(0)=0, fF(0)=1, f'(0)=0, f'”"(0)=—1, +. , 
fem (0) =0, fem+D(0) =(— 1)”, 
so that (13) now gives: 


TEE: aa as A" (— 1) gni 

snt= pc ae rr4 s eat DI 
ginis ; (2n 4-3) z 

a iera] 


The factor 2*"**/(2n + 3)! tends to zero for n — co, as we have seen 
above, whilst the absolute value of a sine does not exceed unity, so 
that the remainder tends to zero for all finite x, i.e. the expansion 
(— gm 


Qn+)! (19) 


; zs x5 
sing =z — 37 tr eee + 


is valid for all a. 
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We can show similarly that the expansion 


_ 1)" Pd 


= e gf Boe 
coss =l — 5r tir’ + =a + (20) 


is valid for all values of x. 

Series (19) and (20) are very convenient for calculating the values 
of sin z and cos x for small angles x. They are alternating for all 
positive or negative x, so that 
if we take such a number of 
terms that the later terms ¿o 
are decreasing, the absolute 
value of the error does not 0% 
exceed the first of the neg- 
lected terms [123]. 

Series (19) and (20) are only 
slowly convergent for large 
values of x, and are unsuit- 
able for calculations. Figure 156 shows the relative disposition 
of the accurate curve of sin x and of the first three approximations: 


Y 







010 05 10 15 20 25 30 35 


Fria. 156 


a3 ab 


v—-3- t 39 


z, eZ 
, 6 3 
The more terms are taken in the approximation, the larger the inter- 

val in which the approximate is close to the accurate curve. We 

remark that angle x is expressed written in circular measure, i.e. in 
radians [33], in all the above formulae. 


Example. To evaluate sin 10? to an accuracy of 1075. We first convert 
degrees to radians: 


arc 10° = 2 x 10 = = 


360 18 =0.17... 





If we take the approximation 


sin Zw Ll E ° 
~~ 18 18 6118) ? 
our error does not exceed 


1 


— 5 - |Z ) 
ig X (0-2) 4 x 10 E <0.2). 


Each term on the right-hand side of the above formula for sin 2/18 has 
to be evaluated to six places, since the total error will then not exceed 


2x0.5x 10-5 + 4x 10-5 = 5x 10-5. 
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Wo have with the accuracy mentioned: 


re límx 
dg 017483; y e = 0.000886 ; sin 77. = 0.173647, 


where the first four places can be guaranteed. 


131. Newton's binomial expansion. Here we have, taking x > —1, 
ie. 1 +x>o: 


f(x) = (1 4- z)", f'(z) =mM1+ 2)", ..., 
f(a) = mm — 1) ... (m — k + 1) (1+ 2)", 
f(0) = 1, f'(0) = m, DE esses 1)... (m — k4- 1), 
where m is any real number; so that formula x gives us: 


o 


(1420 =14->5 M ars EMI 





qoum DL NEUE cipi: (21) 





where the remainder term can be found from (8) with a = 0: 


x 
nl) =>) 1 [0D (t) (a — t dt. 
ò 
Noting that here: 


[4D (t) — m (m — 1)... (m — n) (1 + t)", 


we can write: 


Bye Di Doc O^ 4-2"-^dr. (22) 


ni 





We apply to the integral the mean value theorem (13) of [95], letting 
0x, where 0 < 0 < 1, denote the value of t, lying between 0 and x, 
that appears in this theorem; we get: 





ni 


B, (2) =” (m — 1) ...(m— n) (a — Ox)" (1 + 9))m-n fas = 
0 


0r)"-i1mz. (23) 





Bes AM an (AE Pa 


n! 
If E, —- 0, the series 


m (m — 1) 


147 + z eH a+... (24) 
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must be convergent [118]. We have: 


Und 
Un 








= PA 





zal for n>00, 


so that the series converges (absolutely) for |x | — 1 and diverges 
for |x| > 1 [124]. Although (24) converges for |æ |< 1, it is still 
not clear that its sum is then (1 + x)”, and it has to be shown that 
R,,(a) > 0 for |x| < 1. The factor 


Ens Dt g^ 
n! 





in expression (23) for E,(x) will be the general term of a convergent 
series (24) in which m has been replaced by (m — 1), and therefore 
[118] it tends to zero for n — co. 

The factor [(1 —6)/(1+ 6z)]" does not exceed unity for any m. 
Also, we have here —1 < x < -+ 1, so that 0 < 1 — 0 < 1 -+ 6x for 
all positive or p x, whence: 


6 
1 F 62 


The last factor mx(1 + 6x)”~* also remains bounded, since (1 + 0x) 
lies between 1 and 1 + z, and ma(1 + 0x)" lies between ma and 
mx(1 + x)"~7, these limits being independent of n. 

It is clear from these remarks that (23) gives R(x) as the product 
of three factors, one of which tends to zero, whilst the other two are 
bounded on indefinite increase of n; and thus: 


0<—<1 and o< (as Fer 


«re 


R,(x) — 0 for n — oo. 


Thus the expansion 


m (m — 1) 
“at 


(142) —14- 47a aL. v 


— 1)... (m=n +1) n 
jem mE nti) np... (25) 








is valid for all values of x satisfying: 
CACA 


When the exponent m is a positive integer, series (25) finishes at 
the term n = m and reduces to Newton’s elementary binomial for- 
mula. In the general case, the expansion (25) is a generalization of 
Newton’s binomial theorem for any exponent m. 
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It is useful to note some special cases of the binomial expansion: 


1l 








m E E EE E patt.., (26) 
(rent jarra 

A pm en (27) 

TA 


We remark that (1 + x)” has a positive value for every x > —1 
[19, 44], ic. the sum of series (24) is positive for —1 «x < + 1. 
In particular, for instance, series (27) gives the positive value of 


V1 + x in this interval. 


Examples. 1. Extracting roots. Formula (25) is specially suitable for extract- 
ing roots to any degree of accuracy. Let it be required to find the mth root 
of an integer 4. An integer a can always be chosen so that its mth power 
is as close as possible to A; we put A = a™ + b, where |b| <a”, which 
gives: 








Ta lar zo =a f+ - 


Since here | b/a™ | <1, then letting x denote b/a™, we can calculate y 1+b/a™ 
by Newton’s binomial formula, the series being the better convergent, the 


smaller the absolute value of the ratio x. 
5 


We calculate, for instance, 1000 to an accuracy of 1075, We have: 


| EA, AA, P 1 
Vi000 =/1024— 34 = 4 (2 - “asl hi. 





128 
Ah 1 3.1 a) 5 4 = (as) - | 
dl 5 * 38 510") 8 * to * Te lias} ~ 
We stop at the terms written and find the error, setting in (23): 


== 1 . m 3 . == 3 
E RS 

The factor [(1 — 6)/(1 + 0x)]” lies between zero and unity, as was shown. 
The factor (1 + 0x)!'—! will be: 


5 
3 1% 3 Ys 125 y Us 6 fs 6 pay 
(-*33) "-(i—3) 60m) <) = (14 <(3) 
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since 


We get finally from (23): 


4| Ra] ies T c A ed PEN 
a! < T1X2x3 5 > x = (a) 


<2x0.2x0.8x0.6 x 2.8 x (0.03)! < 51077, 


Calculation of the other terms has to be carried out to six places, since 
the total error then will not be more than: 


4X3x0.5x1079 + 5xX107* = 6.5x 1078 < 1075. 


The calculation can be set out as follows: 














toe 3. —0.0234375 x 0.2 = 0.004687 
5 X 128 
ALS 0 3_Y_0.000549 x 0.08 = 0.000044 
ETE os Es S A dA 
1.4 9 3 
gXQgXig = 0048 | x ls) = 0.000013 x 0.048 = 0.000001 
0.004732 
1 — 0.004732 — 0.995268 
x4 
3.981072 


2. Approximate calculation of the length of an ellipse. The following expres- 


sion was obtained in [103] for the length 1 of an ellipse with semi-axes a and 
b: 


La La 


maf Va? sin? 1 + b? mrar saj J sins + 2 —z cos? tdt 


[formula (22)]. Taking into account the eccentricity e of the ellipse: 





we geb: 


lx 
37 
da | Vi — & cos? dt. (29) 
0 
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This integral cannot be found accurately, but can be evaluated to any 
desired degree of accuracy by expanding the intogrand into a series of 


powers of e:f 
1/1 
(Eat) 


el cost t — 
1x2 


VI — & cos? t = 1 — y e cost + 


als) (5-3 


1x2x3 





£8 cosê t +... = 
= 1 — L et cos? t — -L et cosit DEA 
2 8 16 2 


where the error R,, on evaluation by means of (23) with n — 3, satisfies the 
inequality: 











1 y l ss 3 y 5 
BODAR AR 1—0 3 Td 
p 8 8 2 2 42 
| E] 1x33 £9 cos (o s c) (1 — 0e? cos? t) « 
< Š soi (30) 
32 yy—g i 
since 


1 --0 3 
o< (Iur) <) 


and 
1 
XI =a 
(1 — 0? cos? 1)? — < (1 — ecos? t) ?. 


IL 


Substituting this expression in (29) for l, integrating, and recalling (27) 
[100], we find: 

















E. ps bob 
l = 4a [ape [ooria— 3 — et Jet igo [mace |, dt} = 
0 0 0 
1 3 5 
= 2 4. 6 31 
2na| 1 1 € 64 € 256 E +e], (31) 
where by (10,) of [95] and inequality (30): 
1 1 
3” gr 
2 £8 175 £8 0.05 c8 
dem 4 aed $ . 
lel= pa j^ ay vio = A idt 29 (m < Tm 


T This expansion is in fact possible because <1 for an ellipse, so that the 
term — æ? cos*t, having the role of x in Newton’s binomial formula, is less 
than unity in absolute valuo. 
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Formula (31) is directly suitable for calculating the length of an ellipse, 
especially in the case of small eccentricity. It can also be used as the basis of 
a simple geometrical construction for finding approximately the length of 
an ellipse, using only ruler and compasses. 

We let J, and /, respectively denote the arithmetic and geometric means 
of the semi-axes of the ellipse: 





1 at 1, =Vab , 


and we compare the length J of the ellipse with the circumferences 2zl,, 2x1, 
of two circles of radii /, and l,. 
Noting that 


LL . 4 
b=a Vi- ë, ot? ap M3], lab =aV1 — ez, 








and expanding into series by Newton’s binomial formula, we easily obtain 
the following expressions: 








27l, = 27 h T el 5 El ae } e] " (32) 
2al, = 22a E — T e? 3 El — x ES 4 e. , (33) 


where the errors o, and g, satisfy the inequalities, on evaluating by (23) 


5 £8 77 Es 
lal < gy qp lei < ay a 
Hence it is clear that, with small eccentricity, when higher powers of e 
can be neglected compared with €, the length of an ellipse can be taken as the 
circumference of a circle of radius equal to either the arithmetic or geometric 
mean of the semi-axes. If greater accuracy is desired, we form the expression: 


a * 2l, + B+ 2al, (34) 


a and f being chosen so that as many terms as possible of (31) and (34) 
coincide. Since the first two terms of each of (31), (32) and (33) coincide, 
we must have first of all 


a+ f —1. 
Equating the coefficients of et in (31) and (34), we get further: 


a 36 3 _ 
dé ^3 = 4 or da + 66 = 3. 
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Solving the equations obtained for a and f, we find: 


3 1 
e > Pe ae: 


Substituting this in (34), we have: 











a * 2al, + B - 2al, 22 Gh 71) 


l 4 3 a 5 5.123 1 S 
2na (1 Lo eee sar teh ye) (35) 











i.e. the terms in ef, as well as el, are seen to coincide, and (31) and (35) only 
start to diverge with the term in «8. If we take into account the values found 
above for o, 9, and o,, and if we note that 

1 1 1 175 5 


1 
arca CH Una qui ge tay Xo tgs Xa <4 





we can finally say: the error does not exceed 0.4e8/(1— e?), if the length of an 
ellipse with semi-axes a, b and eccentricity e is taken as the circumference of 
a circle of radius r, where: 


132. Expansion of log (1 + x).+ The general theory can be used for 
this expansion, but we shall employ a second method, which is suit- 
able in many other cases. 

We write log (1 + x) in the form of a definite integral. We evidently 
have, for « > —1: 


x 





(1 +1) = log (1 + x) —log1 = log (1 + 2), 
i.e. 
log (1 + 2) = |. 
å 
But we have the identity: 


e opogd deg 4-(—1yce a EVE [APA 
14213 SS METE 

T The function log x cannot be expanded into a series of powers of x, since 
the function itself and its derivatives have discontinuities at 2 = 0, where 
they tend to infinity. 
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which is immediately obtained on dividing unity by 1 + tand stopping 
at the remainder (—1)"t”. Thus: 


log (1 2 frt 


x 


=[fi-t+e-—e+... + (— yapa y QA a 
0 





A ED p ga), 








where 
x 
t" de 
R, (2) =(- IP [3 (36) 
ò 
The series 
a? qa (— 12" 
a 
for which 
car pco HR | z||z| for n>00, 
Un-1 








is in fact divergent for | x a > 1(Corollary [121]), and we can there- 
fore only consider the cases: 


I|z|«lLz--l. 


Here, z — —1 has also to be discarded, since log(1 + z) becomes 
infinite for this value of x. 

There remain the cases: (1) |x| 1 and (2) z = 1. In case (1), applying 
the mean value theorem [95] to expression (36) for E,(x) and noting 
that i" does not change sign as ¢ varies from 0 to x, we obtain: 


BON 


air 80e 095 





_ (-) 
R,( -h ruse 
whence it follows that, since | z | < 1: 


1 


| E, (e «x TF 0%" 


The factor 1/| 1 + 0x | on the right-hand side of the above inequality 
is bounded for all n, since it lies between the limits: 


1 
1 and IFz’ 
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these being independent of n; so that for the values of x in question 
R,(1)>0 as n—>oo. 
We get the same result in case (2), when x= 1. The above formula 
(37) shows that, for x = 1: 


1 1 1 


| Ra (1)| = nti Lee nti’ 





i.e. again 
R,(1)—0 as n>«oo. 
The expansion: 
n—1,.n 
log (140) =0-Hr E EE (38) 
is thus valid for all values of x, satisfying the inequalities: 
—-l1<r< +l. (39) 
In the particular case of x = 1, we have the equation: 


(yn 


log2=1 ++ o... + D 


which has already been discussed [123]. Formula (38) is not suitable as 
it stands for computing logarithms, since it presupposes that x satisfies 
inequality (39); apart from which, the series on the right-hand side 
does not converge rapidly enough. It can be transformed into a more 
convenient form for computation. We replace x by —x in the equation: 
2 3 
log (1 +2)=2-F+E- TEN 
which gives 


(e|« 1), 





log (1 — x) = — x 
then we subtract term by term. We get: 


log PZ =2(e+ $+...) (£| « 1). 





1 
We put here: 
ltr 2 a+z Er z 
Tog cg Eur T= EZ? (40) 
and obtain: 








a+z z l z3 l z5 
log a lote X Gaz T i. 
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or 
1 3 


This formula is suitable for all positive values of a and z, since for 
these, x = 2/(2a + 2) lies between zero and unity. The smaller the frac- 
tion 2/(2a + 2), or what amounts to the same thing, the smaller z com- 
pared with a, the more suitable the formula is for computation. 


Formula (41) is extremely useful for computing logarithms. The logarithmic 
tables were not in fact worked out by using series, which were still unknown 
at the time of Napier and Briggs; but still, (41) provides a handy means of 
checking or rapidly computing logarithmic tables. We put z = 1 in (41) and 
take the sequence: 


a = 15, 24, 80, 
which gives us: 
log 16 — 1 E E = 2P 
og — log Ə = atnto |= x 
log 25 — 1 24 =2 12 ee ]=2¢ 
og 25 — log a teja? 
log 81 — log 80 = 2 ! I —2R 
og 81 — log 80 = 2 | Ter + gxier T tp ees 


where the series denoted by P, Q, R converge very rapidly. These equations 
lead us to the equations: 


4 log 2 — log 3 — log 5 = 2P, 
—3 log 2 — log 3 + 21og 5 = 2Q , 
—4 log 2 + 4 log 3 — log 5 = 2R, 
which can easily be solved for: 


log 2, log 3, log 5, 
giving us: 
log 2 = 14P + 10Q + 6R, 


log 3 = 22P + 16Q + 10R, 
log 5 = 32P + 24Q + 14R. 


The logarithms obtained by this means will be natural; we use them to 
find the modulus M of the common system of logarithms: 


1 


knowing this, we can pass from natural to common logarithms by means of 
the formula: 
log,,z = M log z. 
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Similarly, factorizing as follows: 
a = 2400 = 100 xXx 23x 3, a + z = 2401 = 7, 
a = 9800 = 100X2x7, a + z = 9801 = 34x11, 
a = 123200 = 100x 21x 7x11, a + z = 123201 = 36x 13?, 
a = 2600 = 100x2x 13, a + z = 2601 = 32x 172, 
a = 28899 = 32x 132X 19, a -+ z = 28900 = 100x172, 


we compute: 
log 7, log 11, log 13, ... 


Having found the logarithms of the prime numbers, the evaluation of 
the logarithms of composite numbers follows simply by additions and multi- 
plications by integers, without the aid of series, since, as we know, composite 
numbers can always be reduced to prime factors. 

133. Expansion of arc tan x. We deal with this as in the case of 
log(1 + x). We have: 


dt 
daro tan £ = IFE ° 


We obtain by integrating: 
377 = aro tan t^ = aret tan 0 = arc t 
pes. = are o = ae an g — arẹ tan 0 = arc tan z, 


where arc tan x takes its principal values, as in the example of [98]. 
We thus have: 


x 
are tan a = (ae E. =f Bp... p (= 1pm g 
0 0 





ee " 2 pp! 2n—1 
+ Gr Al 
where 
^ na 
HR, (2 ) = (= 1) Ji+e > (42) 
The series 
a9 as (— 1)" iam 


for which the ratio 


Un 
Uni 


| 2n-3 


pecu as *—-oo, 
2n— 
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is certainly divergent for a? > 1; so that we confine ourselves to the 
case of z? < 1, i.e. 








Id (43) 
Taking firstly z > 0, we get from (42), by VII [95]. 
* T 2041 1 
| E, (2) | AT a nd Sonar © (n— 00), 
since evidently 
1 - se 


If x < 0, we obtain on introducing a new variable in place of f, 


t= —tT: 
-x 


2n 
R, (2) =(— DH |a de. 
0 





Here the upper limit (—«) is positive, so that we have the same 
value as before for R,(x), i.e. the expansion 


E x B (— 171 420-1 
arotang = © — += q... OA + (44) 
is valid for all x with absolute values not exceeding unity. 

In particular, we get for x = 1: 


7 
arctanl=q=1-3+3. 


This series is unsuitable for computing z, in view of its very slow conver- 
gence. Series (44) converges the more rapidly, the smaller x. Let us put, for 
instance: 








1 1 
tae, and p=arctan ¿+ 
We have: 
2 5 
5 5 6 120 
aa sae p us EpL 1.25 =J" 
795 144 


Since tan 4 y only differs slightly from unity, the angle 49 is only slightly 
different from z/4. Let this small difference be: 


n 
PERA AAA 
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Henee we deduce that: 








n 120 
E (up d tan 4p — tan -T E us |! 201 
2 "pp a 120 — 239" 
l + tan 4pxtan q +5 
which gives: 
7z 1 1 
7 =tr—v=tarctan-—arctan q = 
1 1 1 1 1 1 1 1 
=4[5— go pe me rop t «]- [Ig 7] 


Both series in brackets are alternating [123], so that if we confine ourselves 
to the terms written in each, our error will not exceed 


4 1 


—6 
9x t gx 505x107. 





If x is required to an accuracy of 1073, we must compute the individual 
terms to seven places, since then the error for z/4 will not exceed: 


4X4X0.5x 1077 + 0.5 x 1077 + 0.5x 1079 2x 1078, 


whilst the error for a will be less than 8 x 1075. 
The computation can bo set out as follows: 














1 i 

= = 0.200000 age = 00020001 

1 1 

sage = 00000640 scar 00000018 
zz FN 0.197 395 5 
+ 0.2000640 — 0.0026685 " "i 
i |l 07895820 
339 — — 0.004 184 1 
0.785 397 9 
x 4 





n~ 3,141 591 6 


The value of x to eight places is 3.14159165. 
We can obtain the expansion, with | z|« 1: 


: x l a3 1x3 x5 
are sin & t 3 t 9x4 g ted 


1 
1x3x5...(2n—1) aUi 

spes ( ) dou. 
2X4x6... 2n 2n+1 











(45) 


134. Approximate formulae. Maclaurin's series, when convergent, 
enables a function f(x) to be computed approximately by replacing 
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it by a finite number of terms of the expansion: 





af’ (0 af” (0 
70) +O 4 FO +... 

The smaller x, the smaller the number of terms that need be taken 
in this expansion in order to compute f(x) with the desired accuracy. 
If x is very small, we can confine ourselves to the first two terms and 
neglect all the rest. We thus get a very simple approximate formula 
for f(x), which, for small x, can entirely replace the often very compli- 
cated accurate expression for f(x). 

We note these approximate formulae for the most important func- 
tions: 











Vits 14%, singg, 
1 ~1—, cosg^1— La, 
n 2 
lle 
(14 09 ~1-+4 nz, tanz~ ax, 
a* ~ 1 + zloga, log(1 + z) ~ x. 


By using these approximate formulae for (positive or negative) 
x near zero, complicated expressions can be very much simplified. 

















Examples. 
, m n myn 
turc (+23) m n —m 
1. = ~ (147) (1 a) ~ 
n— m n—m \n n n 
1— ¿3% (.- 7 2 
n n 
m n-—m 
~1l+ 7 et 1 w#=Il+en. 








1— 1 I 1 1 
2, tog | 2 3 leg (1 2) — -y log (420) — yt 30=—0. 


3. To find the increase in volume of a substance on heating (cubical ex- 
pansion), when the coefficient of linear expansion a is known. If one of the 
linear dimensions of the body is J, at 0°, on heating to £? it will be: 


i=1,(1 + ot). 


The coefficient of expansion a is very small for the majority of substances 
(< 1075). Since the ratio of the volumes is that of the cubes of the linear di- 
mensions, we can write: 


2 Ota v =v, (1 + at)? ~ ve, (1 + 3at) , 


Ug 
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i.e. the number 3a gives us the coefficient of cubical expansion. We find a similar 
relationship for the density o, which is inversely proportional to the volume: 


1 


Upa’ 250 (+ ame (13). 


2L 
Qo 


All these approximate formulae are evidently only suitable for sufficiently 
small x; they prove quite inaccurate if this is not the case, and further terms 
of the expansion have to be taken into account. 


135. Maxima, minima and points of inflexion. Taylor’s formula 
allows an important addition to be made to the rules for finding the 
maxima and minima of a function, laid down in [58]. We shall assume 
in future that f(x) has continuous derivatives up to order » at, and in 
the vicinity of, x = £o. 

If the first (n — 1) derivatives of f(x) vanish at x = xy: 


fy) = fix) =... =fO a) = 0, 


whilst the n-th derivative f(x) differs from zero, the value x, corres- 
ponds to a peak of the curve if n, i.e. the order of the first non-vanishing 
derivative, is even, in which case we have : 


a maximum, if f(z.) <0, 
a minimum, if f(z) > 0; 


whereas if n is odd, x, corresponds to a point of inflexion, and not to 
a peak. 
We prove this by considering the differences: 


Foo +h) — f(x) and f(x, — h) — f(x), 


where h is a suitably small positive number. By the definition of 
maximum and minimum [58], there will be a maximum at x, if both 
hese differences are less than zero, and a minimum if both are greater 
han zero. If these differences have the same sign for any arbitrarily 
mall positive h, there will be neither a maximum nor minimum at z£}. 
The differences are evaluated by substituting x, in place of a, and 
+h in place of h in Taylor's formula:t 


T We take Lagrange’s form of the remainder term; the number 6, lying 
between zero and unity, is not the same for (+A) as for (—h), which is why 
we write 0, in the second formula. 
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h > ¡noi 
Í (£o + A) = f (20) + F(20)+.. + Gant (20) + 


+ Ep (a4 00, 


f (es — h) =F (20) — 5f G9) + + DE pte (a) + 


pe [lx —0,h) (0<0<1 and 0<0 <1). 


By hypothesis: 
P (2) =P (20) =... =D (a) = 0, fF (Ho) #0; 


and therefore: 





f (zs +h) — f (2) = E 100 (wy + 0) , 
Í (Zo +h) — f (a) = 


For sufficiently small positive h, the factors: 
fey + Oh) and f(xy — 0,8) 


have, by the assumed continuity of f(a), identical signs, the same, 
in fact, as that of the non-zero number f(z). 

We have seen that there can be a peak at z, when, and only when, 
both the differences: 


oo f (2, — 0,h) . 


f(x, + h) — f(x) 


have the same sign; but this can only be the case when n is even, by 
what has just been said; if n is odd, the factors h” and (—1)"h" have 
different signs, and the differences in question therefore also have 
different signs. 

We now take n even; the differences 


f(x, + h) — f(x) 
both have the sign of f(x,). If f(z,) < 0, 
flo +: h) — fle) < 0, 
and we have a maximum; whilst if f(a.) > 0, 
J(z, X b) — f(x) > 0, 


and we get à minimum. 
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If n is odd, Taylor's formula gives us for the second derivative f"(x) 
for every case of n > 3: 


f (2o +A) = Zar fO (to + 02h) ; 


— \n-2 pn—2 
f' (zo — h) = E 1 (ay — 05h), 


(n 





whence we see, by a similar argument to the above, and noting that 
(n — 2) is odd, that f”(x) changes sign at its vanishing point x= £o 
i.e. xy corresponds to a point of inflexion [71], as it was required to 
prove. 


136. Evaluation of indeterminate forms. We take the ratio of two 
functions 
9 (a) 
v (x) 


which vanish at xz — a. We evaluate the indeterminate form 





g (x) 
y (x) |x=a 





where g(a) = y(a) = 0, by expanding numerator and denominator 
in accordance with Taylor’s formula: 
— a} — ay! o 
p (2) = (e — a) e' (a) + ES gr (a) EA y, 
(art pan (e) 


+ (n+ 1)! 





x — ay z— ay (n) 
v(z) = (x — a) v' (a) + ES yray4...4 EA + 


(x — a H yD (E) 

(n 4- 1)! 
We then cancel out some power of (x — a) in the ratio in question 
and afterwards put x = a. 





+ 


Examples. 








4a? , l6zi 
. 1— oos2z . 1- h1- à t-z +...) 
1. lim ——— ——— = lim 


x 
00 —1—39*  x.0 (+so+ ye + art...) 1390 





162? 


27 





+... 4 


=y: 





im 
x-0 9 , 27 
y + 6 4... 
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The same method is useful in evaluating other types of indeterminate form. 


We take one example: 


2. lim (Vz? —5a* + 1—a). 


X 


Here we have the indeterminate form (co — oc). We have: 





-A- 6-34 


For x sufficiently large in absolute value, the difference (+-=) 
is near zero, and we can apply Newton’s binomial formula with m = 1/3 


and x replaced by — (5/x— 1/23): 


p—-(2- L= (5 - 1), b=) (2 - S pus 


z 23 3 \a as 21 x x3 





Substituting this in the brackets above and cancelling the ones, we get: 


— :(5—1) 
; y ls 
(aria =0[- E ) +4, (2-3) +--]- 


x x 


5 1 
=(-$tgte 


where all the terms not written contain only negative powers of x, i.e. vanish 
in the limit as x > co; hence: 


5 


| ———— 
lim (a3 — 5x? + 1—2) = os 


X0 


It is easy to justify the passages to the limit in infinite series, utilized in 
the present section, and we shall not dwell on this. 


§ 14. Further remarks on the theory of series 


137. Properties of absolutely convergent series. Absolutely convergent series 
were defined in [124]. We now establish their most important properties. 

The sum of an absolutely convergent series is independent of the order of 
ihe terms. 

We first prove this statement for series with non-negative terms, which, 
as we know [120], can only be either convergent (and hence absolutely con- 
vergent) or strictly divergent. 
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Let the convergent series 
Uy + Ug + Ug ft... Unt... (1) 
be given, with positive (non-negative) terms. 
Let s, denote the sum of its first n terms, and s its sum. We evidently have: 
$4 < 8. 


On rearranging the terms of (1) in any manner, we get a second distribu- 
tion of terms, corresponding to the series 


E (2) 


each term of (1) has a definite place in (2), and conversely. Let o, denote 
the sum of the first n terms of series (2). For any n, a greater number m 
can be found, such that all the terms appearing in the sum o, come from 
Sm and therefore 

On S 8m & 8. 


We have thus proved the existence of a constant number s, independent 
of n, guch that, for all n: 
On & 8. 


it follows [120] that series (2) is convergent. Let its sum be c. Evidently, 


o= limo, « 8. 


noo 


Interchanging series (1) and (2) in the above argument, we can prove in 
the same way that 
8X< 0, 


and it follows from the inequalities e < s,s < c, that 
8 =0. 
We now turn to series with terms of any sign. Since series (1) is absolutely 
convergent by hypothesis, the series with positive terms 
Jul tluit ee + d bem Sen! (3) 
n=l 


Is convergent, and its sum s' is independent of the order of the terms by what 
has been proved. On the other hand, both the series 


ed e 
2g (Val + up. 2 y nil Un) 
n= fiss 


(ef. [124]) also have positive terms and are also convergent, since the general 
term of each does not exceed | u, |, i.e. the general term of the convergent 
series (3). 

'Their sums are independent of the order of the terms, by what has been 
proved; and the same can be said of thoir difference, which is identieal with 
the sum of series (1). This was what we required to prove. 
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COROLLARY. The terms of an absolutely convergent series can be grouped 
together in any manner and added in each group, since the grouping is carried 
out by changing the order of the terms, which does not alter the sum of the 
series. 

Note. If any sequence of terms is isolated from an absolutely conver- 
gent series, the series thus obtained is also absolutely convergent, since 
the operation corresponds to isolating a sequence of terms from series (3) 
with positive terms, which evidently does not affect the convergence of (3) 
and in fact reduces its sum. In particular, the series obtained by isolating the 
positive and negative terms of a convergent series are themselves convergent. 
Let s’ denote the sum of the series consisting of positive terms, and (—s”) 
the sum of the series consisting of negative terms. On indefinite increase 
of n, the sum s, of the first n terms of the original series will contain as many 
terms as desired of the two series in question, and we obviously get in the 
limit: 

8 = lim 84 = s' — s". 


It can easily be shown that, when a series is not absolutely convergent, 
the series made up of its positive terms, and that made up of its negative 
terms, are strictly divergent. For instance, the series [124]: 


1 1 1 
l—3tgy—-4t ... 


is not absolutely convergent; and the series 
1 1 
Ib and C474 6$ '"" 


are divergent. The sum of the first n terms of the first series tends to (+ co), 
and that of the second series to (— co), on indefinite increase of n. Riemann 
used the above-mentioned fact to show that, by suitably changing the order 
of the terms of a non-absolutely convergent series, its sum can be made equal 
to any desired number. It is thus seen that the concept of an absolutely con- 
vergent series is identical with the concept of a series, the sum of which does 
not depend on the order of the terms. 

We note further that, if we change the places of a finite number of terms 
of a convergent (not necessarily absolutely convergent) series, the sum s, 
of its first n terms remains the same for all sufficiently large n, i.e. the con- 
vergence of the series is unaffected, and its sum remains as before. The argu- 
ments and results above refer to the case when an infinite number of terms 
is rearranged. 


138. Multiplication of absolutely convergent series. T'he rule for multiplying 
finite sums can be used for cross-multiplying two absolutely convergent infinite 
series: their product is equal to the sum of the series which we ge iby multiplying 
each term of one series by each term of the other and adding the products obtained. 
The order of the terms is of no importance, since the series thus obtained is 
also absolutely convergent. 
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Lot the absolutely convergent series be given: 


i ol (4) 


0=0 ++... + mt... 


We first take the special case of both series having positive terms, and 
also of multiplication being carried out in the following way: 


Uy V, F Uy Va F Ug Vy“ Uy Va Ug VS F Ug v. + s. b Oo + 
F Ug Opa cb s UNO de ese (5) 


First of all, we show that series (5), all the terms of which are also positive, 
is convergent; then we show that its sum S is equal to sc. 

Let S, denote the sum of the first n terms of (5). We can always select 
a large m, such that all the terms composing S, come from the product of 
the sums: 


8g = Ma F MS + e F Um Im = Vi F va F -e tmo 
ie. so that S, < 8; * 8p, i.e. 
Sn < so, (6) 
since Sm < s, Om < 0; hence follows the convergence of series (5) [120]. 


Letting S denote the sum of serios (5), we evidently have, from inequality 


(6): 
S = lim S, < so. 


noo 


We now consider the product s, op. Given n, a large m can evidently be 
found, such that all the terms comprising the product of the sums s, and op 
como from the sum Spm; we then get: 


8405 < Su S, 
and therefore in the limit, as ^ > oo, 
8,0, — 80 < S. (7) 


This inequality, together with (6), gives S = so, which it was required 
to prove. 

Now let the series (4) bo absolutely convergent, but with terms of arbitrary 
sign. We now have the convergence of the series with positive terms: 


Luiji H] u] t+... Flu] +... and [val +l vs] +... + lol +... 
and hence, by what has just been proved, the convergence of the series 


Juil] oil + elo +14 dl eet + beetle] + ee. + 
Hilu onl t oo luniodd t 
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It is clear from this that series (5), composed in accordance with the above 
rule, is absolutely convergent in this case. We now let 


" 


7 5 , uM ou 
Gi, Q3, e Any sj Ay, Ag, +.) Any... 
^ ^ n a ^" 

bj, 05, ---, bn, ---3 DI, bt, ..., On; --- 


denote respectively the positive terms of the series (4), and the absolute 
values of the negative terms. We know (cf. Note of [137]) that the series 
composed of these terms are convergent; we put 


co eo oo co 
s= San, = X bi, = Y an, 0" — Y bs. (8) 
n=1 n=1 n=l n=l 
We have [137]: 
s=s'—s",¢=0'— o". 


As has been shown, pairs of the series (8) with positive terms can be 
multiplied term by term; the sum of the products of the series 


r wow 


so”, s 0”, — S'o”, — s"g' 


contains those, and only those, terms which appear in series (5), and hence 
we have: 


S = s'o’ + s" g^ ES sg” iw seg! = (s ws s") (a* = 0”) 


which it was required to prove. 
Example. The series 





l-gc4g 4. qo 4..-— 


is absolutely convergent for |g | < 1, and therefore: 





1 = = 
(qae PEINE treJ)0c-gae +g TO 





= 142g 4 39*4-...-- nq" ) 4... 


139. Kummer’s test. Cauchy’s and d’Alembert’s tests for the convergence 
and divergenee of series [121], whilst of great practical importance, are 
extremely specialized, and cannot in fact be used in a number of fairly simple 
cases. The test given below possesses much greater generality. 

Kummer's test. T'he series with positive terms : 


s buy oca e s (9) 


ts convergent, provided a sequence of positive numbers a,, dg, ..., Ap, ... Can 
be found, such that we always have, after some initial value of n : 


an 





u —0Ggp12 0270, (10) 
ntl 


where a is a positive number, independent of n ; series (9) is divergent, provided 


364 SERIES. APPLICATIONS TO APPROXIMATE EVALUATIONS [139 


we have, for the same values of n : 


Un 





an — Anta «0, (11) 


Unti 


where furthermore, the series X; la, is divergent. 
n=1 


We can assume without loss of gonerality that the conditions of the theorem 
aro satisfied, starting with n = 1. First, let condition (10) be satisfied. We 
deduce from this, putting n =1,2,3,...: 


Uy — Aglle > (Us, AzUg — AzUg 2 Os, «+ +s On _qUn 1 — nun > aUn, 
whence, adding term by term and cancelling as necessary, we find: 
G(u, +... + Un) & aqu, — apup < at. 


Thus, the sum of the first n terms of series (9) ecluding u, is less than a con- 
stant a,u,/a, independent of n; and therefore series (9) with positive terms is 
convergent [120]. 

Now let condition (11) be satisfied. It gives us: 








1 
Un+1 > Qni 
Un as 

an 


io. Un,1/4, is not less than the corresponding ratio of the terms of the diver- 
gont series: 


2 (12) 


The divergence of series (9) now follows from the lemma below about 
series with positive terms: 

Supplementary note on d’Alembert’s test. If, starting from a certain value 
of n, Uy, ¡U, does not exceed the corresponding ratio Un +1/0n Of the terms of a 


e «€ 
convergent series Š, Un, the series > un is also convergent. Whereas, if Un Jiu, 


nai nel 
e 


is not less than the corresponding ratio of the terms of a divergent series M Un 
n=] 


the series X up is also divergent. 
n=] 
Suppose we have, in the first place: 


Mati g Pati 


5 
Un Un 


where 


Xe (13) 
n=l 


140] GAUSS'S TEST 365 


is convergent. We have consecutively: 


Un Un Un-i « Uni Us « v» 
US é * € yes . 
Un-1 Un-i Una Un-2 Uy Y 





so that we find on cross-multiplying: 


u v u 
e IR or Un <— vg. 
u, vy Y 


The convergence of Z 4, follows from the last inequality and the remark 
in [120] (with k = u,/v,). The divergence of the series can be proved similarly 
in the case when Un ,;[u, > v, [v and 2 v, is divergent. 


140. Gauss's test. Gauss's test has extremely important applications.f 
If in the series (9) with positive terms : 


Us F Ug t... Sunt... 
the ratio wu, [un ,, can be written in the form : 


u 





ee Cares where p>1 and |o,|« A, (14) 
A being independent of n, i.e. c», being bounded, series (9) is convergent for 
u > 1, and divergent for p< 1. 

We remark that d’Alembert’s test is inapplicable in all the cases covered 
by this test [121]. Formula (14) is itself got by expanding u,/u,,, in powers 
of l/n, ie. by sorting out the terms of different orders of smallness with 
respect to l/n, assuming, of course, that this is possible. 

We turn to the proof, and investigate separately the cases: (1) u Æ 1, 
and (2) y = 1. In ease (1), we put a, = n in Kummer’s test, noting that 
an > 0 and that X 1/n diverges [119]. We evidently have here: 





i Un =li BE. pea. cci Cae 
Tm [an S 254] = lim [n (2 + " + 2n) n 1] u—l. 
If u > 1, we shall have, starting from a certain value of n: 


Un 
— an >a>0, 
Unit 





an 


where a is any positive number less than y — 1; and series (9) will be con- 
vergent. Whereas if u <1, we shall have, starting from a certain value of n: 


Un 


an — On <0, 





Uns 


and series (9) will thus be divergent [139]. 
We have in case (2): 
Un pp t p S 
—l4—- 


nP 





Uns 


+ Actually, we are concerned with a generalization of the test established 
by Gauss. 
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We put a, = n log n in Kummer’s test, and form the series 


S 1 1 
a Ta nlogn ” (15) 


where the summation can start at any positive integral n, since the first 
terms do not affect the convergence [118]. We prove the divergence of the 
series written, using Cauchy’s integral test [122]. We must prove the diver- 
gence of the integral: 





co 





dx 
IX Tm (a > 1). 
But we have: 
" de f aloga č f a E 
Í x log æ [ log x -y = log (log 2) : 
a a loga 


and log (log x) increases indefinitely with increasing x=, i.e. the integral 
written above is divergent; hence, series (15) is also divergent. We now form 
the difference a, (4, /U,, 41) — G,41, using (14): 








ap — dpi (14 = + 2a) togn — (n + 1) log (n + 1) = 
Unii 
= (n+ 1) logn + -PLET _ (n + 1) log (n + 1) = 
1 1 
= one + 0i I) log (1— 7} (16) 


The factor wp is bounded by hypothesis, whilst (log n)/n? 7! tends to zero as 
^ —> co, since p — l > 0 by hypothesis and logn increases more slowly 
than any positive power of n (example 2 of (66]). If we put 1/(n + 1) = —x, 
x — 0, and the second term on the right becomes 


1 _ log(l+x) 
xxi) zx d 


i.e. it tends to (—1) [38]. We thus see that ¥ 1/a, diverges in this caso, and 
On (Uf un 41) — Aj > —1 as n—>æ, so that we have for sufficiently large 
Wi dy (uy fus iu) — Op 41 < 0, i.e. series (9) is divergent [139], which is what we 
required to prove. 

The convergence tests given above can be used for series with terms of 
any sign if u, is replaced in them by | un |. But they only enable us to say 
in this case whether the given series is absolutely convergent or not. Generally 
speaking, we can elicit conditions of absolute convergence from them, but not 
conditions of divergence, because we know that series are not necessarily 
divergent when not absolutely convergent [124]. We thus obtain: 

Supplementary note on Gauss’s test. The series 


iy de urbt doa pce (17) 





(n + 1) log ( 
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where the terms are positive or negative, and where 


Un 
u 


On 
np 


(18) 











H 
=14+-+ 
n+1 n 
with p > land | w, | < A, is absolutely convergent for u > 1. 

The series is easily proved to be divergent for u < 0. We have in this 
case, taking into account the fact that w, is bounded: 

w w 

Jm? =T 0. lul for n— œ, 


and thus, starting from a certain value of n, since 4 < 0: 


Un 








n ' np n <1, 








n+1 


i.e. starting from this n, the terms of the series increase in absolute values 
and the general term u, cannot tend to zero as m — co, i.e. series (17) is 
divergent, 


141. Hypergeometric series. The above general considerations will be 
applied to the so-called hypergeometric or Gaussian series : 


ey = pe Ê a(a+DB8(B+D _, 
F (a, f, y; eg ue APEE at... 
a (a 4- 1) ... (a--n—1) B(8 --1) ... (B--n—1) g^ 1 
g A /d ze RT E: MURUS AA col RAI od vee (19 
+ nly(y tI... Fn) da 
Certain functions of applied mathematies lead to such series. The following 
particular cases can easily be verified by direct substitution for a, B, y: 








F(PB20=14 0484 0. a. m => 
F(— m, B, B; 2) = (1 + 2)", 
E —2)—1 Q7 eg +a). (20) 


We investigate the convergence of (19) by forming the ratio of two suc- 
cessive terms: 


Uni (a+ 7) (B 4- mn) 


La ETE RIAS as n>0, (21) 
i.e. by the corollary of [121], (19) is convergent for | | <1, and divergent 
for | x| > 1. The only remaining cases are: (1) x = 1 and (2) v» = —1. We 
also note that the factors (a + n), (B + n), (y + n) are positive for all suffi- 
ciently large n, so that, for « = 1, all the terms of the series have the same 
sign for sufficiently large n, whilst we get an alternating series for large 
^» when «= —]l. 
We expand by the progression formula in the first case (taking n suf- 
ficiently large) and cross-multiply term by term the resulting absolutely 
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convergent series [138]: ; 
un (ntn) | (2) (1+5) 
Unit (a +n) (B +n) (+2) (+£) 











n 
= (i) (+2) (1-2+5-< 2) t) 
142 e 


where c, is bounded. Furthermore, if we neglect in this case a sufficiently 
large number of initial terms of the series 





Fabi e+. + 
y Aa) +. (an DBA 


nly(y4 1 ...(y4 n —1) 


we get a series with terms of constant sign; and on applying Gauss's test 
to this, we find absolute convergence if 





$$ 


y—a—fg-ri»Liey—a—f»0, 
and divergence if 


y—a—f-r-1«l, ie po a—=B8<0. 


In the second case, when x = —1, we get an alternating series after a 
certain initial term: 
1- y a+ DBOD o, 





ley 21y (y +1) 


_ ya 869 4D... (04 5— DEQ 4D... (8-8 — 1) 
e nly(y1)...(9-n — I) mu 


We have here, as earlier: 





Un 














Unit 


and hence, applying the supplementary note on Gauss’s test, we find con- 
vergence if 


y-a—B4+1>1, ie y —a— B0, 
and, divergence if 
y—a-—f--1-«0, ie. y—a—f«-—1. 
In the case where 
y—-a—-g-—-—1, 
it can be shown that the general term of the series tends to a limit differing 
from zero, i.e. the series is divergent [119]. Finally, in the case when 


—l«y—a—f«0, 
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it can be shown that the absolute value of the terms of the series is decreasing 
and tends to zero as n — co, ie. [123] the series is convergent, though not 
absolutely. We do not propose to dwell on the proof of these last two pro- 
positions. 

If we apply this to the binomial expansion 


m (m — 1) 


Aga” 14 a tr 


m(m—1)...(m—n-+]) 
nI 


+ Mus, 

which is obtained from (19) (f = y, arbitrary) by writing (—m) in place of a 
and (—2w) in place of x, and which we know to be convergent for |æ] < 1 
and divergent for | «| > 1, we find that the series written is: 


absolutely convergent for m > 0, if x = —1, 
divergent for m < 0, if x = —l1, 
absolutely convergent for m > 0, fa=l, 
non-absolutely convergent for —1<m<0, if x = —1, 
divergent for m> —1, ife=1, 


degenerating to a polynomial for m = integer > 0. 

We show later [149] that if a binomial series is convergent for xv = +1, 
its sum is (1 + 1)”, i.e. 2" or 0 respectively. 

It must be noted that we have assumed above that a, B and y are neither 
zero nor negative integers. This is important for y, since otherwise the terms 
of the series become meaningless (the denominator vanishes), whilst if a 
or B is zero or a negative integer, the series breaks off and reduces to a finite 
sum. 


142. Double series. We take a rectangular table of numbers, bounded 
above and to the left, but stretching to infinity on the right and downwards: 





1 2 3 n . 
lw, Ugn Uy Van 
Us, Ugg Uag Un 
3 | Ug, Ugg Ugg e Ugn +... (22) 
m Um "ma “ms Umn 


The number of the row in the infinite set of rows is denoted by the first 
subscript of u, and similarly, the number of the column in the infinite set 
of columns is denoted by the second subscript. Thus, v; indicates the number 
at the intersection of the ith row and the kth column of the table. 

Let us assume, in the first place, that all the uj, are positive. 
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We define the sum of all the numbers of (22) by marking points with 
positive integral coordinates M(t, k) in a coordinate plane and drawing 


a series of curves: 
Ci Cap eas E 


intersecting the axes in the first quadrant, and subject only to the conditions 
that every point M falls inside the area (C4), bounded by the curve C, and 
the axes (Fig. 157), for sufficiently large n, and that area (C,) lies inside 
(Cn41). We form the sum Sp of all the numbers 4;,, corresponding to the 
points lying inside the area (C,,). This sum evidently increases as n increases, 





Fre. 157 Fic. 158 


so that only two cases are possible: either (1) S; is bounded for all values of 
n, in which case there exists a finite limit: 
lim S,=S, 
n—= æ 
or (2) S, increases indefinitely as n increases. 
In case (1), we say thut the double series 


Z uir (23) 
ik=1 


is convergent to the sum S. In case (2), series (23) is said to be divergent. 

The sum of a convergent series (23) with positive terms is independent of 
the method of summation, i.e. of the choice of curves On, and can be obtained by 
summation of the series in rows or in columns : 


S= 23 (2 2 =% (Z us) : (24) 
k=1 i=l i=1 \k=1 

i.e. we first find the sums of all the terms in each row (or each column 

of the table, then add the sums obtained. 

Suppose, in fact, that we take any other system of curves Ci, Cz ..., 
Cn» ..., with the same properties as C], C5, ..., Cp, ... Let S, denote the 
sum of all the numbers of the table corresponding to points lying inside the 
area (Cp). Given n, we can always choose a large m such that area (C) appears 
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inside (C,,), in which case 
Sy < SmS S, 
ie. there exists a finite limit: 
lim Sp=S’ <8. 
no 
We can prove similarly, on interchanging the roles of the C, and On, that 
S< 8’, 
which is only possible with 
S= 8’. 
The sum of the double series (23) can be obtained even if we take as Cn 
step-lines formed from segments of straight lines (Fig. 158): 
4 = const., k = const. 
This gives us a summation “by squares": 
S = Uyy + (Ugg + Ugg Un) HR + 
+ (Uant Uon H -e F Unn H Unna d eg) bee 
Summation “by diagonals” gives us: 
S — uy + (Uy, + Ug) + Qs F Ug + Ua) ch... + 
E (Uin + Ue, pa ++ Up) eee (25) 


We prove formula (24) by first of all noting that the sum of any number 
of terms of table (22) is less than S, and hence the sum of the terms in any 
given row, or in any given column, is also always less than S, whence follows 
the convergence of each of the series 


oo e " 
, 
A Vi s, 2j Uik = sk- 
K=1 [1 


We have in addition, for any finite m and n: 
m 


up I Rud PX P^ ua) <S, 
(26) 


n o 
814 95 ... += X (X ua) <8. 
We shall take, in fact, only the first m rows of table (22). We evidently 
have, on taking from these the elements of the first p columns: 


E ($ va) <8. 
k=l 
We have by the rule for addition of series [119]: 


e m 
spat... db 9m Y (3 Up) = lim y (È ua) <S, 


k=1 lí=1 Poo k=1 





since the expression under the limit sign is not greater than S. 
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The proof of the second of inequalities (26) is similar. 
Inequalities (26) show that both the series: 


o o o e e ceo " 
E(Ew-XExje Š (Suu) = 3 dae 
i-1 1 =1 


k=1 i-i K-1 Vi= 
are convergent, and have sums not exceeding S, i.e. 
oc'« S and o"« S. 


Obviously, on the other hand, for any choice of the system of curves 
C, all the terms that appear in tho sum S, appear in both the sums 
stat... fom, spay... tom 
with sufficiently large m, i.e. 
S, «8, 4-85; 4... --8m « 0^, S,<eft... Sn «0, 
and in the limit, therefore, 


S= lim S, <o and S<o”. 


po 


Since o’ < S and g” < S, we can only have 


o = "=Ñ, 
which it was required to prove. 

As regards double series with terms of any sign (positive and negative), 
we only stop to consider absolutely convergent series, i.o. those for which the 
double series consisting of absolute values: 


Luig | 
1 


FMa 


i, 
ts convergent. 


We can use an argument similar to that of [124] to show that the sum : 


Sins > ( zi = > un) (27) 
n= œ ici W=1 k=1 Ui=1 

exists for such series, being independent of the method of summation, and 
being obtainable, in particular, by summation by rows and by columns. 

Remark. Many properties of absolutely convergent simple series can be 
extended to absolutely convergent double series; in particular, the remark 
of [124] becomes: if the terms of a double series do not exceed in absolute value 
the terms of a convergent double series with positive terms, the given series ia 
absolutely convergent. 

Property (2) of [120] can be similarly extended. 

Examples. 1. The series 





2, mx (28) 
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is convergent for a > 1, f > 1, since summation by squares gives: 


s- E (E )- ($4) 


i=1 Wi=1 ? í=1 





vd 
(2 75) <48, 


where A, B denote the sums of the series 


e 1 o 1 
a EE 
pU d KP 


which are convergent for a > 1, B > 1 [122]. 
2, The series 
E 1 
jf (i + k)? 
is convergent for a > 2, and divergent for a < 2, since summation by diago- 
nals gives us: 


(29) 





. 1 1 1 
A cL. = l): -—= 
pa er ede eae adus 
1 1 1 1 
zm A ]—— n 1-— — , 
2971 ( 3s Tue ( a 


whence we find, by replacing (1 — 1/n), firstly by the smaller number 1/2 
secondly by the larger number 1: 


3] : bebe ett 
n 








2 92-1 92—1 perl 


Our proposition now follows from the convergence of Y l[n?-! for 
n=1 


a < 2 and its divergence for a < 2. 
3. If a and c are positive, and b* — ac < 0, the series 


bet 1 


hog (a + 2bik okt? 





(30) 


as convergent for p > 1 and divergent for p < 1. 
We first take b > 0. Let A, denote the least of the numbers a and c, 
and A, the greatest of a, b, c; since obviously, 
P+ kh > Wk, 
we have: 
2A tk < ai? + 2bik + ck? < Ai + k}, 
whence we find, on confining our interest to the case p > 0: 


1 1 1 1 1 
< < 3 
AB (i--k)? ^ (ai*-L2bik--ck') (2A,)P iPk? 
noting that the factors 1/48 and 1/(24 ,)P are independent of i and k, this gives 


us convergence for p > l and divergence for p < l, by Examples l and 2 
and the remark made above. 
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Now take b < 0. Letting A, denote the greatest of a, c, |b|, we have 
by the obvious inequality (Vai)? + (Vck)? > 2Vacik: 


2(b + Vac) ik < ai? + 2bik + ck? < Ali + ky, 


where b + Vac > 0, since |b| < Vac by hypothesis. The rest of the proof 
follows the same lines as for b > 0. 


143. Series with variable terms. Uniformly convergent series. Taylor's and 
Maclaurin's formulae are examples of series, the terms of which depend on 
a variable x. In the second part of this course, we shall become familiar with 
the extremely important trigonometric series of the form: 


eo 
Y (a, cos nz + b, sin nz) , 
n=1 


the terms of which also depend on a variable z, as well as on n. 

We now give a general discussion of series with terms that depend on an 
independent variable xv. 

Suppose we have an infinite sequence of functions: 


U (5), UalL), ..., Un) ... >, (31) 


defined in an interval (a, b). If the infinite series 


ule) + tym) +... tuple) +... (32) 


ts convergent for all values of x in this interval, ie. for a « x< b, we say 
that it is convergent in the interval (a, b). 

The sum of the first n terms of series (32) is obviously a function of gv, 
and similarly for the sum of the complete series and its remainder; we denote 
these respectively by 

Sp(2), 8(%), rn(x) , 
so that 


s(x) = lim s, (2), rp(@) = 8(x) — 8,(x). (33) 


If (32) is convergent in (a, b) and has the sum s(x), this means that, for 
any given x of (a, b), given any positive number e, a number N can be found, 
such that 

irala] <e forall n>N, 


with N obviously dependent on the choice of e. But it must be noted that 
N will in general also depend on the z chosen, i.e. it can have different values 
for a single assigned e and different values of z from (a, b); we indicate this 
by writing N(x). If for any given positive e an N can be found, independent 
of x, such that for any x of (a, b) we have the inequality 


Lm) | <e (34) 


for all n > N, series (32) is said to be uniformly convergent in the interval 
(a, b). 
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We take as an example the series 
1 1 " 1 _ 
etl (@+I)(@+2)  (-2)(z-3) "" 
1 
^o (e-n—li(zrt» 








es (35) 


where x varies in the interval (0, a) and a is any given positive number. 
It can easily be seen that the series can be written as: 








1 E: 1 1 ) Ls 1 
etl atl a+ 2 u+2 zr) 


-fai pw 
zi—i-zi«) m 


so that in this case 











1 n 1 
iL = cere 9 (x) = lim s, (x) — 0, i) =- Fra? 

and if we want to make 

In) =- <s (36) 

n ^ æn , 
it is sufficient to take 
1 
n>-—— a= N(2). (37) 


If we now want to satisfy (36) for all x of (0, a), given that n > N where 
N is independent of the x taken, we need only put N = 1/e> N(x), since 
(37), and therefore (36), will then be satisfied for n > N for all x of (0, a). 
Series (35) is thus uniformly convergent in the interval (0, a). 

Not every series has the property of uniform convergence, since it is 
not possible to find an N independent of x for every series, such that it is 
not less than all N(x) in interval (a, b). 

Take, for instance, the series 


g + æl — 1) + elg — 1) +... e CO DS A (38) 


in the interval 0< g< 1. 
The sum of its first n terms is: 


Salt) = æ + (0) + (00) +... + (ot — ad), 
i.e. 
Sp(a) = 2", 
and hence [26]: 


8 (x) = lim s, (x) =0 for O«z«1 
n- 0 
and 


t (£) = s(x) — sple) = —a" for 0<«xw<l. 
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For v = 1, substitution of v = 1 in (38) gives us the series 


FEOEO Pou 
i.e. 
8, (a)=1, s(x) = lim s, (a) — 1, 


nov 

Tals) = s(2) — 8,(z) = 0, 
for v = 1l and any m. Series (38) is convergent throughout the interval 
0< æ< 1, but its convergence is not uniform in the interval. In fact, since 
r (£) = —a” for 0 < x < 1, if we want to satisfy inequality (34), | rn(w) | < e, 
we must have 

z" <e, ie. nlogx «loge, 
or, dividing by the negative number log z: 

loge 


n> . 
log x 





In this case, therefore, N (x) = loge/log x and cannot be replaced by a 
smaller number. 

Asx approaches unity, log x > 0, and the function N (x) increases indefinitely; 
so that it is impossible to find an N, such that inequality (34) is satisfied 
for n > N in the whole of (0,1). A consequence of this fact is that, whilst 
(38) is convergent throughout (0,1), including ~ = 1, its convergence be- 
comes slower as œ approaches unity; the nearer cr is to unity, the more 
terms of the series have to be taken in order to approach its sum. Yet it is 
to be noted that the series simply breaks off at the second term when, in 
fact, 2 = 1. 

We now give a second definition of uniform convergence, equivalent to 
the first. We formulated above [125] a necessary and sufficient condition 
for the convergence of a series. This now becomes: a necessary and sufficient 
condition for the convergence of series (32) in the interval (a, b) is that, for 
any given positive e and for any x of (a, b), there exists an N, such that 


| ner (2) + Unya (0) + -e + Unip (2) | < € (39) 


for n > N and any positive integer p. For a given e, this N can still depend 
on the choice of x. If for any given positive e there exists the same N for all 
x of (a, b), such that (39) is satisfied for n > N and any positive integer p, 
series (32) 4s said to be uniformly convergent in the interval (a, b). 

We have to show that this new definition of uniform convergence is equi- 
valent to the first, i.e. if a series is uniformly convergent in the first sense, 
it is uniformly convergent in the new sense, and vice versa. To start with, 
let the series be uniformly convergent in the first sense, ie. |ry(w)| < e 
forn > N, where zv is any v of (a, b) and N does not depend on x. We evidently 
have: 

Unc (2) + Unie (2) + «+ Unep (x) = Tp (2) — Tp =p (x) (40) 


and therefore: 


| Unyi (2) + usus (2) +... + Unsy (2) | < [rq (2) 1+ [nap (2) | - 
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which gives, for n > N, and so n +p> WN: 
[Una (X) + ugs (2) H + Up, p (2) | < 2e. (41) 


In view of the arbitrary choice of e, we see that the series is uniformly 
convergent in the new sense. We now start from the assumption that the 
series is uniformly convergent in the new sense, ie. that (39) is satisfied 
for n > N, where N is independent of z, for any positive integer p and for 
any « of (a, b). Noting that 

ry (2) = tinsa (2) + nsa (2) + -++ = Dim [ugs (2) + +++ tn (2)] 
we obtain in the limit, as p — co in (39): 
| rn) |< e 
for n > N, i.e. since e is arbitrary, the first definition of uniform convergence 
follows from the new definition. The equivalence of the two dofinitions is 
thus proved. 

We remark that 7, (2) was used in the first definition (34) of uniform con- 

vergence and, at the same time, the additional assumption was made that 


the series was convergent; whereas, the fact of the convergence of theseries 
is contained in the new definition of uniform convergence, of (39). 


144. Uniformly convergent sequences of functions. The sequence of func- 
tions: 
S(0), SL), ..., Sp(Z), -.., (42) 


which we considered above, was defined with the aid of series (32); 8,(%) 
denoted the sum of the first n terms of the series. But sequence (42) can be 
taken as given and can be considered on its own account, and a series con- 
structed from it, the sum of the first n terms of which is the nth term of 
the sequence s, (+). The terms of this series are obviously defined by: 


ur (2) = 8,(x), UalL) = sala) — &1(2), ..., Un(1) = Sp(2) — Sp 1(82), -.. (43) 


Sequence (42) is often simpler than (43), as was the case in the examples 
considered. 

We arrive in this way at the concepts of convergence and uniform con- 
vergence of sequences of functions: 

If a sequence of functions (42): 


8,(2), SAL), -.., nE) -.., 
is defined in an interval (a, b), and if for any x of this interval the limit exists: 
s(x) = lim 8, (2), (44) 
no 
sequence (42) is said to be convergent in the interval (a, b), whilst s(x) is called 
the limit function of sequence (42).. 


If, in addition, for any previously assigned positive e there exists a number N, 
independent of x, such that the inequality 


| a(2) — spl) | «e (45) 
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ts valid for every n > N throughout (a, b), sequence (42) is said to be uniformly 
convergent in the interval (a, b). Condition (45) may be replaced by the equivalent 


| 82) — sale) | < e (46) 
for m and n> N. 
The condition for uniform convergence of sequence (42) is equivalent to the 
condition for uniform convergence of the series 


ula) + ula) dh... d up) He sss, (47) 
where (43) 


u (a) = 8,(1), ula) = s(x) — s (2), ..., Unlo) = 8, (3) — 8n (0), ... 


The proof of the equivalence of conditions (45) and (46) in the study of 
the uniform convergence of sequences is similar to that of the equivalence 
of conditions (35) and (36) for infinite series. We also note that the uniform 
convergence of 8,(%) in any part of (a, b) follows at once from its uniform 
convergence in (a, b). 

The uniform convergence of sequences can be interpreted geometrically. 
If s(x) and s,(x%) are illustrated graphically for different values of n, uniform 
convergence of the sequence implies that the piece of ordinate, comprised 
between the curves s,(x) and s(x), 
must tend to zero as n-> co, for y 
all x of (a,b); this condition is not 40} 
fulfilled with non-uniform conver- 
gence of the sequence. 






/ 05 
Y= GF 750 OO 














10 20 05 10 
y=5(x)=0 Y2SCK) = O(x <1) 
Fie. 159 Fig. 160 


The situation can be seen by inspection of Figs. 159 and 160, drawn for 
the examples that were discussed above: 


s(t) =o aa) =a" 


t Different scales are used in Figs. 159 and 160 for x and y, for greater 
clarity. 
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In the case of Fig. 160, the limit function s(x) is represented graphically 
by the segment (0,1) of axis OX, excluding the point 1, and by the isolated 
point with coordinates (1,1). 

Admittedly, the limit function s(x) in the last example is not continuous; 
but an example can easily be found of a sequence which, whilst having a con- 
tinuous limit function, is never- 
theless non-uniformly convergent. Y 

The sequence (Fig. 161): 







AX 
Y Tr aT Sn OO 


nx 
sn (2) = TF na (0 < x <a) (48) 05 
nef 
n=2 
ned 


has this property. We evidently 
have, for x # 0: 






0 Os 10 


ne 1 c 15 20 
Tana n , . 161 
l4 nia n la Fic. 16 


and as n — co, the first factor on the right, 1/n — 0, whilst the second tends 
to lz, ie. s4(x) > 0 for 2% 0. Obviously, for x = 0, 8,(0) = 0 for all n 
and we therefore have, for all z of (0, a), where a is a given positive number: 


8 (a) = jim 8, (z) = 0. 


But the maximum piece of ordinate between curves 8,(x) and a(x), which 
simply reduces in the present case to the ordinate of s,(%), since s(x) = 0, 
is 1/2 (corresponding to # = 1/n). Since it does not tend to zero as n — co, 
sequence (48) is not uniformly convergent in (0, a). In fact, if we want to 
have: 
nc 


| 8 (2) — 8n (2) |= TE ma 


<E, 
solving the second degree inequality: 

0<1— =. n+ an? 
for n, and taking e sufficiently small, we get: 


n> zr (VI —4&) = N(z). 

This function increases indefinitely as x > 0, which accounts for the non- 
uniform convergence of the sequence. 

We remark finally that Figs. 160 and 161 also show that the sequence 
æ” is uniformly convergent in an interval (0, q), where q is any positive 
number less than unity, whilst the sequence nz/(1 + n%x?) is uniformly conver- 
gent in (q, a), where 0 <q <a; a direct proof also of these statements 
may also easily be given. 
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145. Properties of uniformly convergent sequences. 1. The limit function 
of a sequence of continuous functions, uniformly convergent in an interval 
(a, b), is also continuous. Let 


ElL), BL), eres Enl), ... 


be the given sequence of functions, all of which are continuous in (a, b), 


and let 
s(x) = lim s, (x) 
n—o 


be the limit function. We have to show that, for any previously assigned 
small positive e, a 6 can be found, such that [35]: 


| s(@ + h) — a(x) | < e (49) 
à [Al <6, 

assuming that both x and x + h lie in (a, b). We can write for any n: 

| sæ + b) — s(x) | = 

= | [s(x + h) — snl + h)] + [8n(@ + h) — an(2)] + [8nlz) — s(2)]| < 

< | s(z + h) — syle + h) | + | 8(2) — enla) | + | snl + h) — snío) |. 


By the definition of uniform convergence, we can select a suitably large 
n so that, throughout (a, b), including the points x and x + h: 


ls (@ +h) s(a HAE dei) ese 


Having fixed such an n, by the continuity of s,(x) [35], we can find a 
6 such that 


[pío +h) — an (m) <> if [A] <d. 


We combine all these inequalities, to obtain inequality (49). 

If the sequence of functions is non-uniformly convergent, the possibility 
arises of the limit function not being continuous; the sequence æ” in the 
interval (0,1) can be quoted as an instance of this. 

The converse does not hold, however: the limit function of a non- 
uniformly convergent sequence may be continuous, as, for instance, in the 
case of the sequence: 

na 
1 + nz? f 
2. If 
8i(2), 8,(%), ..., BaL) ... 


ts a uniformly convergent sequence of functions, continuous in the interval 
(a, b), and (a, B) is any interval lying in (a, b), 


B B 
fes (2) de > $ a(x) de (50) 
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or alternatively, 


B B 
lim $ 8g (1) de = f lim e, (x) da. (51) 


Ne y a +e 


If the limits of integration are variable, e.g., B = a, the sequence of functions 
x 
$ 8, (t) de (n =1,2,3, ...) (52) 
a 


is also uniformly convergent in the interval (a, b). (This process is referred 
to as passing to the limit under the integral sign.) 

We note, first of all, that the limit function s(x) is also continuous, by 
property (1). We now take the difference: 


B B B 
$ e (2) de — s, (£) dæ = | [s (2) — s, (2)] de. 


Given e, in view of the uniform convergence, we can find an N such that, 
for all n > N, we have throughout (a, b): 
| 8(@) — silo) | < €, 
and hence [95] (10,): 


B 
« flet) — o to) dn f eda = e ($ — a) < € (b — a). 


a 


B 
j [8 (£) — 8, (2)] de 








We therefore have, for any interval (a, 8) contained in (a, b): 


B B 
$ $ (2) de — | sn (2) de «s(b—a) for n>N. 








The right-hand side of this inequality is independent of a and f and tends 
to zero if e > 0. In view of the arbitrariness of e, we can state the result thus: 
for any given positive e, there exists an N, independent of a and f, such 
that 


B B 
f s(x) dz — f 8, (1) de | < e, 


a 








for n > N. Formula (50) immediately follows from this. Putting f — x, 
and remembering that N is independent of f, we see that sequence (52) is 
uniformly convergent for all x of (a, b). 

The theorem can be proved false for a non-uniformly convergent sequence. 
For instance, let: 


8, (£) = nze ™ (0 «€ z «1) 
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(Fig. 162). It can easily be shown, taking the cases x > 0 and x = 0 separa- 
tely, that 


8n[1)>0 as n> co 


for all x of (0,1), so that here, s(x) = 0. The sequence cannot be uniformly 
convergent, however, since the greatest ordinate of the curve y = 8,(x) 
or, what is the same thing, the greatest difference ¢,(x) — s(x), which is 
obtained with x = 1//2n, increa- 
Y ses indefinitely as n > oo. 
On the other hand, we have: 





and at the same time: 


1 
f 8 (2) dz —0. 
0 


3. If the functions of a sequence : 


8,(2), &,(z), $2594 8n(), . 





have continuous derivatives : 


Fig. 162 


si(2), sz(m), <- > &p(); - 


in an interval (a, b), and if the sequence sí (x) is uniformly convergent to the 
limit function a(x), and sequence sn (a) convergent to a(x), sn(x) is then also uni- 
formly convergent and: 


d 
(2) = SED, (53) 
or alternatively : 
d li 
E ES) 
Mo de ~ dz 





(54) 


This process is called passing to the limit under the differentiation sign. 
Let x be variable in the interval (a, b) and a be any constant. We have 
by property (2): 


x x 
lim f s (x) da = f o (x) de. 
no q a 
But 


$ an (2) de = 8, (2) — 8, (a) > 8 (2) — 8 (a), 
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and therefore the previous formula gives: 


x 
8 (2) —8(a)= | o (x) de. 
Differentiating this equation and using a familiar property of definite 
integrals (property VII) [95], we have: 


which it was required to prove. It remains to prove the uniform convergence 
of sequence s,(z). We have: 


x 
8,(@) = 8,(a) + f &n(z) de. 
i x 
Sequence s,(a) is convergent, and it does not contain x. Sequence f ej (x) dz 


a 
is uniformly convergent by property 2. Hence follows the uniform convergence 


of s,(z), since it is a direct result of the definition of uniform convergence 
that the sum of two uniformly convergent sequences is also a uniformly 
convergent sequence. Of course, every convergent sequence, the terms of 
which do not contain v, as for instance, s,(a), falls under the definition of 
uniformly convergent sequence. 

We also notice that use has only been made of the uniform convergence 
of s,(zx) and of the convergence of s,(a) in order to prove the uniform con- 
vergence of s(x) throughout (a, b); it is thus sufficient to demand the con- 
vergence of s,(x) at a single point x = a when formulating the last property. 
The uniform convergence of s,(z) throughout (a, b) follows from this, as we 
have already said. 


146. Properties of uniformly convergent series. If we take s,(x) in the 
above propositions as the sum of the first n terms of a given series 


U(x) + Ulla) E... + Un) + ..., 


and s(x) as the sum of the whole series, we at once obtain analogous proposi- 
tions for series with variable terms. 


1. If the terme of the series 
use) + uale) +... + un(a) +... (55) 


are continuous in the interval (a, b), and the series ie uniformly convergent, 
its sum s(x) is also continuous in (a, b). 


2. If the terme of series (55) are continuous functions in the interval (a, b) 
and the series is uniformly convergent, it can be integrated term by term between 
any desired limits a, $, lying in (a, b), i.e. 


E os "E 
qp us (a) dz = Y f tq (a) da. (56) 


a n=l 
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If the limits of integration are variable, e.g. B = x, the series obtained by 
term-by-term integration : 


x 


f u, (de + fur (d+... + [uq(e)de+..., (57) 


a 


is also uniformly convergent in the interval (a, b). 


3. If the series (55) is convergent in the interval (a, b) and the derivatives of its 
terms, U(X), u(x), ..., ug(x), ..., are continuous in (a, b), the series formed 
by the derivatives 


i ay As AA E uus 


being uniformly convergent in (a,b), the given series (55) is uniformly con- 
vergent, and can be differentiated term by term, i.e. 





v dum) 


— 58 
nai de e 


It need only be borne in mind, in order to deduce these propositions from 
the theorem of [145], that, as we know already, the properties indicated in 
the propositions are valid in the case of a finite number of terms. For instance, 
if the terms u,(x) of the series are continuous, the function 


Sp(2) = 4 (2) + us(z) +... + un(e) 
4s continuous for any n [34]. 


147. Tests for uniform convergence. We state sufficient conditions for 
uniform convergence. A series of functions, defined in an interval (a, b): 


4s uniformly convergent in (a, b) if one of the following conditions is satisfied: 
(A) A sequence of positive constants can be found, 


My Mz, ..., Mn... 
such that 
| un(x)| < Mn in (a,b) (59) 
and the series 
M, +M, +... +4 Mrt... (60) 


is convergent (Weierstrass’s test). 
(B) The functions u,(x) can be put in the form : 


Un (1) = ava) , (61) 
where Gy, Az, ..., An, -.. are constants, such that the series 
a,tat...+ta,+... (62) 


is convergent, and functions v,(x), ..., n(x), ... are all non-negative and 
less than a positive constant M, and satisfy for all x of (a, b) : 


v (x) > v, (£) >... > (2) >... 5 On (2) <M (63) 
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(Abel's test). 
Proof (A). Since series (60) is convergent, given e, an N can be found 
such that, for all n > N and for all p [125]: 


Mgac Maat ++ + Map <8 
and using (59): 


| uni (2) + ++ Ep (|< Maa t + Magy < E, 


whence follows [143] the uniform convergence of series (55). 
Proof (B). We put: 


Op = an1 + Opia F ++ + apip (p=1,2,...), 
which gives us at once: 
Oni =0, ADA Api = Gk — Ok (k » 1). 
We find: 
Unser (©) + Uns (2) + -> + Unip (2) = 
= aptr Unser (L) + anre Cne (2) + «+ Opp Unip (2). 


We substitute here for a,,, in terms of op, and collect terms in the same 
Oj, giving: 
Ons Vntr (X) + Ogre Care (2) + +++ + nip Ops p (2) = 
= 05 Vari (2) + (03 — 01) vn: (2) + +... + (0p — 0p—1) Varp (2) = 
— [tna (x) — Un+2 (2)] Tec 
+ Spy [ta p-i (2) — "nip (2)] + op Unip (2). 


Recalling that v4,,(r) and all the differences %,, y (1) — v, (2) are 
non-negative by hypothesis, we can write: 


[Uns (2) + ++ Unsp (2) | < [07 Mon (2) — tna 0] + +++ + 
+| opi | [vn pi (2) — "nip (2)] +] op | Vn+p (2), 
or, letting o” denote the greatest of the absolute values | a; |, | o5 |, ..., | ejl, 
luna (0) + >>> + pap (2) < 
<o {lon (2) — vara (2)] + ++ + nip-1 (2) — Crip (2)] + nip (x)} 
which gives after cancelling: 
| tnr (2) + e+. + Up (2) | < 0 vasa (2). (64) 
It follows from the definition of c, and the convergence of series (62) 


that, for any given positive e, there exists an N such that 


z € 
(ol < -37 


for n > N and any k; and hence: 


< £ 
M 
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Bearing in mind that also, 0 < v4,,(x) < M, by hypothesis, we get 
from (64): 
[Bari (z) + -> + Upp (2) | «s 
for n > N and any p. Since N does not depend on z, the uniform convergence 
of series (55) in (a, b) follows from this. 


Examples. 
1. The series 








(p>1) (65) 


n=l nP n=1 n? 
are uniformly convergent in any interval, since we have for any x: 


cos nc 





, 


sin nx 
|< 











nP n? 


1 
and the series 2 is convergent for p > 1 [122] (Weierstrass's test). 
n 


2. If Y a, is convergent, the series 
n=1 


S An 
— 66 
E an) 
is uniformly convergent in the interval (0 < 2 < 1) for any l, since, on sub- 
stituting here 


1 
v, (1) = — 
n xL 
all the conditions of Abel's test are satisfied. 


148. Power series, Radius of convergence. One of the most important 
applications of the above theory of series with variable terms is the power 
series, i.e. the series of the form: 


a, + aa + aqu? p ... ta He Lll, (67) 


already encountered by us in the discussion of Maclaurin's formula. The 
detailed study of the properties of these series belongs to the theory of func- 
tions of a complex variable, so that we only indicate here the most funda- 
mental properties. 

ABEL's FIRST THEOREM. If the power series (67) is convergent for a certain 
value of x = $, it is absolutely convergent for all x satisfying 


lel «| £l. (68) 
Conversely, if it is divergent for x = £, it is also divergent for all x satisfying 


]z|]|6]r. (69) 
Firstly, let the series 


ao + aË + ai p on ta,e4+... 
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be convergent. Since the general term of a convergent series must tend to 
Zero, 


a, — 0 as nwo, 


and hence a constant M can be found, such that for all n: 
] a4 | < M. 


We now assign to z any value satisfying (68), and put 


;- | <1. 
We obviously have: 
a" n 
Lana" | |an P los PET] < Mg", 














i.e. the absolute value of the general term of (67), for the x in question, does 
not exceed the general term of an infinite decreasing geometrical progression. 
Series (67) is therefore absolutely convergent [124]. 

The second part of the theorem is obvious, since if (67) were convergent 
for some z satisfying (69), by what has just been proved, it would have to 
be convergent for any é satisfying | ¿] « |x|, which contradicts what is 
given. 

COROLLARY. There exists a fully defined number R, called the radius of 
convergence of series (67), and having the following properties: 


series (67) is absolutely convergent for |x| <R, 
» (67) is divergent for |x| > R. 


In particular, it may happen that R = 0, in which case series (67) is divergent 
for all x differing from zero; or that R = «o, in which case (67) is convergent 
for all x. 

We dismiss the first case, and consider a positive value of « = £, for which 
(67) is convergent. Such & value certainly exists, if there exist in general 
values of x 4 0 for which (67) is convergent. If we increase the number 
é, only two cases can arise: either (67) always remains convergent for z = $, 
even when é increases indefinitely, in which case we obviously have R = œ; 
or else there exists some constant A, with the property that, however close 
¿ approaches A, whilst remaining less than A, series (67) is always con- 
vergent, yet when ¢ is greater than A, the series becomes divergent. 

The existence of such an A is quite obvious on intuitive-geometric grounds, 
since, on the basis of Abel's first theorem, if the series is divergent for any 
given value ¢, it will also be divergent for all greater values. The existence 
of A can be proved rigorously from the theory of irrational numbers. This 
number A is obviously the radius of convergence R of series (67). 

We give the proof of the existence of R. We divide all real numbers into 
two classes as follows: we put in the first class all negative numbers, zero, 
and the positive å for which (67) is convergent with | «|= £, and we put 
in the second class all the remaining real numbers. By the theorem proved, 
any number of the first class is less than any number of the second class, 
ie. we have made a section in the domain of real numbers. There is therefore 
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either a greatest number in the first class, or a least number in the second 
class [40]. It is easily seen that this number is the radius of convergence R 
of the series. 1f every number falls into the first class, we have to take 
R = co. 


149. Abel’s second theorem. If R is the radius of convergence of series (67), 
the series is uniformly, as well as absolutely, convergent in any interval (a, b), 
lying wholly within the interval (—R, +R), i.e. for which 


=R<a<b<R. 


If the series is also convergent for x = R or for x = —R, it will be uniformly 
convergent in (a, R) or (—R, b). 

We note, first of all, that we can take R = 1 without loss of generality, 
by taking a new independent variable ¢ instead of x, defined by 


x = Rt. 


Series (67) now becomes a power series in £, and the interval (—R, +R) 
becomes (—1, 1). 

If R — 1, it follows from the definition of radius of convergence that 
(67) is absolutely convergent for any æ = £, where | ¿| < 1. We now take 
any interval (a, b) inside (—R, R), so that 


—l <a <b <l. 


We choose for any number lying inside (—1, 1), but greater in absolute 
value than | a | and | b |. We have for every x in (a, b): 


| apa | < | ané” |, 
and since the series 


ao + aË + agg? +... H au EP +... 


is absolutely convergent and its terms are independent of z, it follows by 
Weierstrass’s test that (67) is uniformly convergent in (a, b). 
We now suppose that (67) is also convergent for x = 1, i.e. that 


Go +H ai t+a,+... Hant eese 


is convergent. Putting 
vala) = a^, 


we can apply Abel’s test to series (67), showing that (67) is uniformly con- 
vergent throughout an interval (a, 1), where a is any number greater than 
—1. 

The case of (67) converging for « = —1 follows from the above on sub- 
stituting (—2) for x. 

Let f(x) denote the sum of series (67). It only exists, of course, for the x 
for which the series converges. Let R be the radius of convergence of the 
series. Bearing in mind the uniform convergence of the series in any interval 
(a, b), for which 

—R<a<b<R, (70) 
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and also property 1 of [146], we can assert that the sum f(x) of the series is 
continuous in any of the above intervals (a, b). In other words, f(x) is continuous 
inside (—R, +R). We also see that this function has every type of derivative 
inside (—R, +). If series (67) also converges for x = R, it follows from 
the uniform convergence proved for any interval (a, R), where a> —R, 
that f(x) is continuous in this interval, and in particular, f(R) is the limit. 
of f(z) on x tending to R from the left [35]: 


7(R)= lim f(z). (71) 
x+R-0 
Similarly for the convergence of the series for g = —R. 


We saw above that Newton’s binomial expansion [131]: 


'm (m — 1) 


2 
2] ap 


i + a)" —1 + p2+ 


has a radius of convergence R = 1, and converges in certain cases for x = 
= +1. We are able to say, by what has just been proved, that if the series 
converges for z = 1, for instance, its sum will then be: 


lim (14237 — 2", 
x= 1—0 


150. Differentiation and integration of power series. Let R be the radius 
of convergence of the series 


a + ax + ax? +... H aux uu. (72) 


We get two further power series by integrating (72) term by term from 
0 to x, and by differentiating it: 


age att... E ES (73) 
a, + aye + 3agi* +... + nag 3 +... (74) 


We show that these have the same radius of convergence R. We have to 
show, for this, that they are convergent if | æ | < R. 

Series (72) is uniformly convergent in any interval (—R,, +R,), where 
0 < E, < R, by what has been proved; and by property 2 of [146] it can 
be integrated term by term from 0 to x in this interval, i.e. we can say that 
series (73) is convergent for any x, for which |x| < E, and that the sum 
of (73) is then 


x 
Jit, 


where f(x) is the sum of series (72). We now show that series (74) is also con- 
vergent if | x| « R. We take such an z, then choose a ¿, lying between 
|x| and R, i.e. 

jal <¿<R, (75) 
and put 


lel 
q= E <l. 
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We now have for the terms of series (74): 


zi nts 1j. 
| na a" 1px Ma eU , 


and by the above: 
= cp 
| maya" | < ng" -E | add? |. 


It is easy to show, by applying d’Alembert’s test to the serios X ng't—1, 
that it is convergent for 0 <q < 1, and hence [119]: 


nq" 1-. 0 as n> œ, (76) 


so that, for all sufficiently large n: 
[na 27 | < |as£" |. 


But by (75), the series Xa,é" is absolutely convergent, and therefore 
series (74) is absolutely convergent for the v taken. Both series (73) and (74) 
are thus convergent if | v| < R, ie. the radius of convergence of a power 
series is not decreased by term by term integration or differentiation. But it 
immediately follows from this that it can also not increase. Suppose, for 
instance, that the radius of convergence of series (73) were R’, where R’ > R, 
then differentiation of series (73) would give us series (72), and its radius 
of convergence would not be less than R’, by what has just been proved; 
but in fact it is R, where R < R’. Thus, series (73) and (74) have the same 
radius of convergence R as series (72). Differentiating (74) a second time gives 
us a power series 


2a, + 3- 2a; x + 4-30, 4- ... -- n(n — la ett... 


with the same radius of convergence E, by what has been proved; and so 
on. All these power series are uniformly convergent in any interval (a, b) 
for which (70) is true; and the same applies for repeated term by term integra- 
tion. Recalling properties 2 and 3 of [146], we can finally state the follow- 
ing result: 

The power series 


ao t SS aux p ..., (77) 


the radius of convergence of which is equal to R, is a continuous function of 
x inside the interval ( — R, R). 

The series may be differentiated and integrated term by term any number 
of times, if x lies inside (— E, R), the power series obtained in this way having 
the same radius of convergence R. The sums of these latter series are equal to 
the corresponding derivatives and integrals of the sum of series (77). 

Putting 


f(z) =a, + aum + aut... taaw™+..., (78) 
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we obtain in this way: 
f'e) =a, + 2aet... t nay ttl b uuu, 
f(x) = 2a, + 6agc +... n(n — l)ag t7? LLL, 
f) (v) = nlan 4- (n+ l)n... 3:2a4 42 H ..., 


which give with z = 0: 


f' (0) I" (0) f? (0) 
a, = f(0) , a= AT ag ToT eet A E ale 
Substituting these expressions for dg, @,,@:, ...,@, in (78), we obtain: 
zf (0) , x*f" (0) x” {™ (0) 


Fi PO t rag te +...(-R<2<+R), 


ni 
ie. a power series coincides with the expansion of its sum by Maclaurin’s 
formula. 

The theory of power series given above is easily extended to power series 
of the form: 


a, + a(x — a) + als — ay + ... + anle — a) p... (79) 


The role of z is played everywhere by (x — a). The radius of convergence 
R of series (79) is defined by the condition that the series converges for 
|x —2a| « E, and diverges for |x — a | > R. If f(x) denotes the sum of 
(79) in the interval 


—-R<a—a<R, (80) 
we obtain the expressions for the coefficients a: 
" f(a) Po 
a =f(a), a= Teo NL apos 


i.e. series (79) coincides in the interval (80) with the expansion of its sum into 
a Taylor series. 

We return to the theory of power series in the third volume, when dealing 
with the theory of functions of a complex variable. 

We propose as examples that the expansions of log (1 + æ), arc tan v, 
and arc sin x be deduced from the theory of power series, noticing that 





de 
log (1 + x) = Lie 
0 
x 
arctan = | dz 
4.414427 
0 


x 


arcsing = |————-; 
T li—a 


the domains of application of the expansions obtained should also be inves- 
tigated. 
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EXERCISES ON CHAPTER IV 
Write down the formula for the nth term of each of the series 1-10: 
1 1 1 1 1 1 1 
1. 1+ ztg tyt saa tf 2. strateteyt..: . 
2 3 4 1 1 1 
31H AI A ST 
3 4. 5 6 2 4 6 8 
a A o 
1 1 1 1 1 1 
Toy tg tas 3-9 039 5a: 5000 
1:3 1-3-5 1:3:5:7 
8. ltir tirit rrot e 
9. 1-1+1-1+1-=1>+%.... 
1 1 1 
10. 14 3+3+ +84 7 ++... . 
Examine for convergence the series 11-21 using the comparison test: 


1. 1—14-1 — 1-F ... E (— 193 4 .... 
2 pig eerie 


2 3 4 n+l 
BEATA taper eel 
1 1 1 — pr 
M Ga o +P. 
Vlo Vio y10 


1 1 1 1 
13. HG A boe 
16 Lap oap ous yoo 
So Tar 31 5777 T 10n+1 Utt 
1 1 1 1 
17. —— — L one TUM 
y1:3 + y2-3 + Jy3-4 + + Yn(n +1) 
2 B n 
18. 2-- FA H 
1 1 1 
19. 1 mr eee — eee . 
tatt tat 
1 1 1 1 
20. orb gebe bo 
1. 1 is fa 
2 3 n 
21. — — == —— oo ———- ene 
s tag ta (n + 1) n T 
With the help of d'Alembert's test investigate the convergence of the 
series 22-23: 


1 3 5 2n — 2 
22. atza GG T^g be 
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2 25 , 2-5-8 2-5-8... (8n — 1) 
23. T tis trest: Ct $$... ün—3) T 
With the help of Cauchy's test investigate the convergence of the 
series 24-25: 
2 3y 4 y n+l 
24. 3 + (3) + (5) 4... ty +... 
l 913 335 n y2n-1 
25. = + (5) + (5) toc (RM) hr. 


5 





Investigate the convergence of the series 26—54: 
1 1 
26. 1437 +3 + ty L+. 


1 1 1 
AE 
1 1 1 1 
28. 4 tug tag te t Gn) Gn te: 
1 4 4 
29. ¿HG 0390 to 


30 yt tto rt 
3L ay tpr tar tota pate 
32 it t e t+. 
1 3 1, 
e sene 
42484374 erm. 
a 








35. Teres net r t 

36. Sarcsinn 2. 37. > sin(n^?). 
n=1 n=l 

38. > log (1 + x) 39. > log (1 + =) 
n=l n=l 

40. > (log n) 41. > (n log n) 
n=l n=1 


œ 


1 


V y-1 -2 
42. Sn” (log n)7?. 43. 2,1108 nTogloga * 


n=1 
eo 


44. > (n? — n. 43. 5 [n (n + my . 


co -i < 
46. 3 PODA 4% > gap 


n=1 n=2 
co 
1 4 nls 
48, =, hain 9, 





lus 


(2n — 1) (nls — 1) * 
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! o 2" 
50. > E — Cos =). 51. 3. 52. 2 n 





n=1 nal n 
eo eo 
53. 9 n7"n! 54. Denn! 
n=1 n=l 


Investigate the convergence of the series 55-66. In the case of con- 
vergent series state whether the series is absolutely or conditionally 
convergent. 

(are 


eee —...$- +. 
Bero. tet $ 
js zT Vn ` 
(y 
57. 1444 + 
58. Pe ee up A 
s 7 13 6n—5 i 
3 5 1 2n+1 
muc doo Hee neus 
1 2 3 4 " 
60. —;.— 1 eta HH IOP E... 
"DRM A A SE NES 
2/2—-1 * 3/3-1  4l4—1 


oe Vis tl) 
Sepe milega 








62. a+ (r) Go) Hot) + " 
63, e PET... 





1 357... (2n 4-1) 
TCI sg. .. (8n — 1) +. 
l 1.4 1-4-7 
64. 5 T9 t Tyi esc 
n1 1:4:7 ... (8n — 2) 
ae Ga 5-81... On ES) +... 

















65. < sinna . 66. V 1 niogn 
= (log 10)” 2 ( ) n 
Investigate the convergence of the series 67-73 with complex terms: 
67. = nr) o : 2 n(2i — 1)" . 
2 2 : = 3" 
69. > —-, W Sia. ROV 
nay n(3 +i) na A Acl Ig 
1, s x e > A 


— (ntin < [n+ (2n — 1) 532 
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74. Construct two series with the property that their sum is con- 
vergent while their difference is divergent. 


75. Construct the product of the series > n, Y 27"*1, [s this 


n=] n=1 
product convergent ? 


eo 2 
76. Construct the series [= 2 and consider its convergence. 
n=1 


77. Given the series > (— 1)";n! Estimate the error involved in 


n=1 
taking the sum of the series to be given by the sum of the first 
four terms. Estimate it also when the sum of the first five terms 
is taken. What is it possible to say about the signs of these errors ? 


78. Estimate the error in replacing the series > + by the sum 
n=l 
of its first n terms. 
79. Estimate the error in replacing the series > i by the sum of 
n=1 


its first n terms. In particular estimate the error corresponding 
to n= 10. 
Find the interval of convergence of the following series 80-93 and 
investigate the convergence at the end-points of the interval. 
80. Y. 8L oira, 82 Noar2n—1). 
n=1 n=1 


n=1 


18 


83. $ (4n — 322 ARMA, 84, N (— 1) tat, 


n=1 


3 
D 





85. S (n+ 12n +1). 86. N(— 1)" (2n + a7. 
n=0 n=0 

87. San! 8 Salat. 89 Yara, 
n=0 n=1 n=1 

0. w( n gn, 91. $3" gr, 
? —, ( 2n +1 ) K = 
Sey 3. Sal nar, 
92 2 erg) 93. Sal nna 
Determine the circle of convergence for the series 94-100: 








94 Nin 95. N (1+ ni)z". 
=0 =0 
96 pe (z — 2)" 97 B5 ¿an 
° = n3" — = Qn 
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98. (1+ 2i) + Tip caes ved 
+(1+2)( 3 +24). (n+ 1+ 202 4. 


Pr MS ts is 





ES um (nm) Te 
100. Y m e". 
2 n Fa) 
Prove the uniform convergence of the series 101-103 in the intervals 
stated: 


101. > n~? 2" in the closed interval [—1, 1]. 102. > 2” sin nz on 
n=1 


n=1 


the whole real line. 103. > (— 1)? n3" in the closed in- 
n=1 
terval [0, 1]. 
Applying term-by-term differentiation or integration find the sums 
of the series 104-110: 


104. A eee ee : 





105. 2-242. 4 (19 E. 
a3 25 gent 
106. s+ + ee bey + 
3 2n—1 
107. z— 5- + ES Di Gs El 
108. 1 + 2z + 332 + ... -- (n+ 1) a^ 4... 
109. 1—32? + Sat — ... + (1/1 (2n — 1) g? p... 


110. 1.2 + 2.32 + 3-4 +... a(n p Ia +... 
Find the sums of "a a PO 


111, ++ lA um Pb 
2 E opi 
1 1 1 (—1yr^! 
113. 1—53 + “5-32 ea gus i 
1 3- 5 2n — 1 

1M. Pune bee 
115. Expand the polynomial z? — 223? — 5% — 2 in powers of x + 4. 
116.1f f(x) = 523 — 4a? — 3x + 2, write f(z +h) as a series of 

powers of h. 

117. Expand log z as a series in z — 1. 
118. Expand l/x as a series in x — 1. 


n 


+. 
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119. Expand 1/2? as a series ina + 1. 

120. Expand e* as a series of powers of x + 2. 

121. Expand | as a series of powers of x — 4. 

122. Expand cos x as a series of powers of (x — 2/2). 

Expand the functions 123-126 as power series in x, stating the interval 
of convergence: 


8 
123. sin? x cos? x. 124. (1 + x) e *. 125. (1 + e*)?. 126. /8 + x- 


CHAPTER V 


FUNCTIONS OF SEVERAL VARIABLES 


§ 15. Derivatives and differentials 


151. Basic concepts. We began the study of functions of two 
variables in $ 6 of Chapter II by explaining the basic concepts regarding 
these functions. We shall now discuss functions of several variables, 
and also deal in more detail with the concept of limit. 

We shall assume that the function f(x, y) is defined either in the 
whole plane or in a certain domain. A definite value f(x, y) thus 
corresponds to every point (x, y) of this domain. If interior points only 
of the domain are considered, the domain is said to be open. If the 
domain includes its boundary, it is said to be closed. 

Similarly, if we take a rectangular system of Cartesian coordinates 
OX, OY, OZ in space, we are able to talk of a point M of space with 
coordinates (x, y, 2), instead of talking of a set of three numbers 
(x, y, z). We shall assume that the function f(x, y, 2) is defined through- 
out all space, or insomedomain of space that may be either open or 
closed. The boundaries of the domain (there may be several) will be 
surfaces in the simplest cases. For instance, the inequalities: 


a Kea; bd, < Y <b 01 < 2 < Cs 


defines a closed rectangular parallelepiped, the edges of which are 
parallel to the coordinate axes. The inequalities: 


a « € « 05 b, «y «b c «2 «6, 
define an open parallelepiped. The inequality: 
(c —a + y— b+ @—o <r 


defines a closed sphere with centre (a, b, c) and radius r. If the equality 
sign is omitted, leaving only the sign <, an open sphere is obtained. 
Limit and continuity are defined for functions of three variables in 
exactly the same way as in [67] for two variables. 
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Geometrical terminology is often preserved in the case of functions 
F(%1 Lo, - - -, £n) of several variables, where n > 3, although the space 
can no longer be visualized geometrically. A sequence of n real num- 
bers (z,,25, ...,2n) is called a point. The totality of such points 
forms an n-dimensional space. A domain of the space is defined by 
inequalities. For instance, the inequalities: 


c S zi S di; Ca AAA ns < dn 


define an n-dimensional parallelepiped, or, as we sometimes say, 
an n-dimensional interval. The inequality 


n 
(aa <0 
k=] 


defines an n-dimensional sphere. The set of points defined by the 
last inequality with a certain choice of r, or by the inequalities 
| 24 — a, | < o (k = 1, 2, ..., n), where pis a certain positive number, 
is called a neighbourhood of the point (a,, a4, ..., a4). 

If the function f(x,, x, ..., Ln) is defined at, and in a certain neigh- 
bourhood of, the point (m a, ...,@n), we say that f(x,, %, ..., Ln) 
tends to a limit A, or that the point M(x, 2, ...,%,) tends to the 
point M (di, a, ...,@,), and we write: 


lim /(2,,25,...,2,) = A or lim FT o, En) = A, 
Xe ay M-—M, 


if for any given positive number e there exists a positive y, such that 
| A — f(xy, La, -.., En) | < e, provided only that | a, — zy | < y with 
k=1,2,...,n, the point M(x,,%,, ..., %n) being assumed distinct 
from M (di, a, ..., An). The continuity of f(%,, 2, ..., %n) at the 
point M (d1, a, ..., an) is defined by the equation: 


lim f(%,, a... En) = f (84, Ag, . . ., 0). 
Xk-* OE 

The properties indicated in [67], for functions continuous in a 
closed domain, still hold. 

The continuity of à sum, product and quotient of continuous func- 
tions can be justified, as in [34] for functions of a single variable. 
In the case of a quotient, the denominator is assumed to differ from 
zero at the point (a,, a, ..., Gn). 
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152. Passing to a limit. We dwell in more detail on the concept of limit, 
confining ourselves to the case of a function of two variables. If the limit 
exists, 

lim a,y) = A, 
lim f (2,4) i 
yb 
we shall say that a limit exists with respect to both variables. This means, as we 
know [67], that f(x, y) tends to a limit A as the point M(x, y) tends to M (a, b) 
in any prescribed manner. In particular: 
lim f(x,b)= A and lim f(a,y)= A. (2) 
xa y>b 

Point M(x,y) tends to M (a,b) along a line parallel to OX in the first case, 
and along a line parallel to OY in the second case. We remark that the 
existence of limit (1) does not follow, however, from the existence of limits 
(2) and their equality. We take as an example the function f(x, y) = xy/(a?* + 
+ y?), and we take a = 0 and b = 0. We have: 





R : a0 . E 
lim f(2,0) =lim 3 lim 0 0 and lim / 0,9) =0, 


whereas limit (1) does not exist in this case. In fact, putting y/x = tan a, 
we can re-write our formula as: 


Hay) xy tana 


wip y 7 lqtana A (3) 





If M(x, y) tends to M(0, 0) along a line passing through the origin and making 
an angle a, with OX, f(x, y), as given by (3), remains constant and its magni- 
tude depends on the choice of a,, whence it follows that limit (1) does not 
exist in the example considered. We note that formula (3) does not define 
the function at the point M(0,0) itself. 

Instead of the limit (1), we can consider an iterated limit; here, we first 
pass to the limit with respect to x, with y constant and differing from b, 
then pass to the limit with respect to y, or vice versa: 

lim [lim f (e, y)] or lim [lim f(z,y)]- - (4) 
xa y-b yb xa 
It ean happen that both the iterated limits exist but are different. For 
instance, taking the function 


a? — y? 4 w yo 





f (e, y) gd qa + y? ^ 
it is easy to show that: 
lim [lim f (x, y)] = 1; lim (lim f (a, y)] = — 1. 
x+0 y>0 y20 x-0 


The following theorem holds, however: 
THEOREM. If the limit with respect to both variables (1) exists, and if for 
every x, sufficiently near to, but differing from, a, there exists the limit 


lim f (x, y) = 9 (2), (5) 
yb 
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the first iterated limit (4) exists and is equal to A, i.e. 
lim y (x) = A. (6) 


x-a@ 
It follows from the existence of limit (1) [67], that for any given positive 
e there exists a positive 7, such that 


¡A — f(z, y)| < efor |2x—a|<nyand|y—b| <7, (7) 


(x, y) being assumed not to coincide with (a, b). We fix x, different from a, 
so that |x—a|< 7. Taking (5) into account, and passing to the limit 
in inequality (7), we get: 


| A — g(x)| < e for |z—a| < nand z za, 
from which, in view of the arbitrariness of e, equation (6) follows. 


Note. Similarly, if we presuppose the existence of limit (1) and that, 
for every y sufficiently near, and differing from, b, the limit exists: 


lim f (x, y) = v (y), 
xa 
the second iterated limit (4) exists and is equal to A, i.e. 
lim y (y) — A. 
y>b 


If limit (1) exists and is equal to f(a, b), i.e. A = f(a, b), f(a, y) is conti- 
nuous at the point (a, b), or, as we say, is continuous with respect to both 
variables at (a, b). Here, by (2): 


lim f (a, b) =f (a, b); lim 7 (a, y) = f (a, b), 
x—a yb 


i.e. the function is continuous with respect to each individual variable at (a, b); 
this was discussed earlier [67]. Continuity with respect to both variables does 
not follow, on the other hand, from continuity with respect to each variable. 
Suppose, in fact, that a function is defined by (3) except at the origin, and 
that we put f(0,0) — 0. We have here, as mentioned above: 


lim f (2,0) =0 and lim f (0, y) =0, 

x>0 y>0 
i.e. the function is continuous with respect to each variable at (0,0). But it is 
not continuous with respect to both variables, since, as we have seen, f(x, y) 
has no definite limit as M(x, y) tends to M (0,0). 


If f(x, y) has partial derivatives in a certain domain containing (v, y) as 
an interior point, the formula is valid, as we have shown [68]: 


f(a + Ax, y + Ay) — fla, y) = fyl + 94v, y + Ay) Ae + 
+ fy, y + 6,4y)4y (0<0, 9, « 1). 


We assume that the partial derivatives are bounded in the domain con- 
cerned, i.e. their absolute values do not exceed some number M. The formula 
written gives in this case: 


| fla + de, y + Ay) — fo, y) < MO 4e| + | 4y 1), 
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and the right-hand side of this inequality tends to zero as Ax -> 0 and 
Ay > 0, whence it follows: 


pasts (z + Az, y + Ay) =f (zy), 
Ay—0 
i.e. if f(x, y) has bounded partial derivatives inside a certain domain, it is contin- 
uous in this domain. 
Given the additional condition f(0,0) = 0, function (3) is zero on the whole 
of OX and the whole of OY, and it evidently has partial derivatives equal 
to zero at M,(0,0). It also has partial derivatives at the remaining points 


y — ay g3 — ay? 
ie. the function concerned has partial derivatives over all the plane. In 
spite of this, it does not possess continuity at the point (0,0), as we have seen. 
TThis is explained by the fact that the partial derivatives can take values as 
large as desired in absolute magnitude as the points (rz, y) approach the 
origin. 


f(z, y) = fy (ay) = 


153. Partial derivatives, and total differential of the first order. 
The concepts of the partial derivatives and the total differential of a 
function of two variables were introduced in [68]. These concepts can 
be extended to the case of functions of any number of variables. We 
take a function of four variables as an example: 


w = f(x, y, z, t). 


The partial derivative of this function with respect to x is defined as 
the limit 

lim f(a+h,y,z, t) — f (x,y,z, t) , 

4—0 h 





if it exists; it is denoted by one of the symbols: 


fy 2,1), or S e Cu SH. i OF Ed i 
The partial derivatives with respect to the other variables are similarly 
defined. 
The total differential of a function is defined as the sum of its partial 
differentials: 


Ow Ow Ow Ow 


where dz, dy, dz, dé are the differentials of the independent variables 
(arbitrary magnitudes, independent of x, y, 2, t). 
The differential is the principal part of the increment of the function: 


Aw = f(x + dz, y + dy, z + dz, t + dt) — f(x, y, 2,1), 
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and in fact (cf. [68]): 
Aw = dw + edu + edy + edz + edt, 


where £,, &, £4, €, tend to zero if dz, dy, dz, dt tend to zero, w being 
assumed to have continuous partial derivatives within a certain 
domain containing (x, y, z, t) as an interior point. 

The rule for differentiating functions of a function can similarly 
be generalized. Suppose, for instance, that x, y, z are not independent 
variables, but functions of the independent variable t. In this case, 
w will depend on £ both directly, and via x, y, z, and the total deriva- 
tive of w with respect to t will have the form: 


"dr A e ard be dear 





dw Ow dx Ow dz 
(8) 


We do not dwell on the proof of this rule, since it consists of a word 
for word repetition of that discussed in [69]. If z, y, z depend on other 
independent variables besides t, we must write the partial derivatives 
Ox/0t, dy/dt, 02/01, instead of dx/dt, dy/dt, dz/dt on the right-hand side 
of (8). Function w will depend in this case on the other independent vari- 
ables as well as on £, and we must also replace dw/dt by dw/dt on the 
left-hand side of (8). But this latter partial derivative differs from the 
partial derivative Ow/0t appearing on the right-hand side of (8), 
the evaluation of which only takes account of the direct dependence of 
w on t; brackets are sometimes used to distinguish this latter partial 
derivative, so that equation (8) takes the form here: 





B 7 + Get T Oy à 5 Ge Oe 


We saw that, in the case of a function of a single variable, the expres- 
sion for its first differential does not depend on the choice of the indepen- 
dent variable [50]. We show that this property remains valid in the case 
of a function of several variables. 

We take for clarity a function of two variables: 


Ow Ow Ox Ow dy Ow dz 
m) (9) 


z = ple, y). 
We suppose that x and y are functions of the independent variables 
u and v. We have, in accordance with the rule for differentiation of 
functions of a function: 


Oz Oz Or z Oy Oz dz Ox , Əz Oy 
| 


du E A Oy Ww ` 
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The total differential of the function is by definition: 
Oz Oz 
We obtain on substituting the expressions for the partial derivatives: 
_ Oz (Ox Ox Oz (dy Oy 
dz = = (= du + 3, do) tw (a du + 2 dv). 
But the expressions in curved brackets are the total differentials 
of x and y, and we can write: 
Oz Oz 


i.e. the differential of a function of a function has the same form as it 
would have if the variables were independent. 

This property allows the rules for finding the differential of a sum, 
product and quotient to be extended to the case of functions of 
several variables: 


d(u + v) = du + de, d(uv) = vdu + u do, d= = 


v du — u du 
vt 





, 


where u and v are functions of several independent variables. For 
instance, we can use the property proved to write: 
9 (ut) 


d(uv) = 299 du T de = vdu + udv. 


154. Euler’s theorem. A function of any number of variables is callep 
a homogeneous function of degree m of these variables if multiplication 
of all the variables by an arbitrary magnitude t results in multiplication 
of the function by t”. 

We confine ourselves for the sake of clarity to the case of a function 
of two variables; we can say that f(x, y) is called a homogeneous 
function of degree m if it satisfies the identity: 


fa, ty) = t" fle, 9). (10) 
Suppose, for instance, that f(x, y) is the expression for a certain 
volume, that x and y are the lengths of certain lines, and that, apart 
from these lengths, only abstract numbers appear in the expression. 
Multiplication of x and y by t is equivalent to decreasing the scale £ 
times, and it is clear that the number expressing the volume must now 
be multiplied by #, i.e. f(x, y) will in this case be a homogeneous 
function of the third degree.t 


+ For instance, the volume of a cone is expressed in terms of the radius 
r of its base and its height h by the formula: V = + ar?h. 
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Differentiation of identity (10) with respect to ¢, using the rule for 
differentiation of a function of a function for the left-hand side, gives 
us the identity: 


afila, v) + yfy(u, v) = mt" fla, y) , 


where u = tx and v = ty. We find on substituting t = 1 in this iden- 
tity: 
afa, y) + fy, y) = mfx a), (11) 


which is the expression for Euler's theorem : 

The sum of the products of the partial derivatives of a homogeneous 
function and the corresponding variables, is equal to the function itself 
multiplied by its degree of homogeneity. 

If m = 0, we get, on putting t = l/æ in (10): 


fey) =1(1,2), 


i.e. a homogeneous function of degree zero is a function of the ratios of 
all the variables to one of them. The sum of the products of the partial 
derivatives with the corresponding variables must be equal to zero 
for such a function. A homogeneous function of degree zero is often 
called simply homogeneous. 


155. Partial derivatives of higher orders. The partial derivatives of 
a function of several variables are in turn functions of these variables, 
and we can in turn find their partial derivatives. We thus obtain partial 
derivatives of the second order of the original function, and these are 
again functions of the same variables; differentiation of them leads 
to partial derivatives of the third order of the original function, and 
so on. For instance, in the case of the function of two variables, 
u = f(x, y), further differentiation of each of the partial derivatives 
du/ox and du/doy with respect to x and y leads to four second order 
derivatives, denoted by: 


x? (x, y), hy (a, y) Ix (x, y), fya (x, y) 
or 
Pf (xy) (y) f(x,y) f(x,y) 
d > Orüy ^ Oyóc ^ Oy ^C 


Qu, Ou Qu du 
Or? OrOy' Oy Ox’ Oy!" 





or finally, 





The derivatives fz,(v, y) and fyx(=, y) only differ in the order of 
differentiation. In the first case, differentiation is first with respect 
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to x, then with respect to y, and in the reverse order in the second 
ease. We show that these two derivatives are identical, i.e. the order 
of differentiation does not affect the result. 

We take the expression: 


w = f(x + h, y + E) — fu + hy) — fe y + k) + flay). 


Setting 
pla, y) = f(x + h, y) — f(z, y) , 


we can write c in the form: 
w = [f(x + hy + k) — fe y + E) — [f(x + hy — f(x, yl- 
= pla, y + k) — glx, y). 
Applying Lagrange’s formula twice [63], we get: 


o = k gle, y + Ok) = [file + hy y + 6,0) — file, y + 69] = 
= IMfss (e + Osh, y + 0,8). 


The letter 0 with either subscript denotes a number lying between 
0 and 1. We denote by the symbol fj(x + h, y + 0,k) the partial 
derivative of f(x, y) with respect to its second argument y after sub- 
stitution of x + h and y + 6,k for x and y respectively. An analogous 
notation is used for the other partial derivatives. 
Similarly, putting: 
v(z, y) = f(x, y+ k) — f(x, y) , 


we can write: 


o = [f(x + hy + k) — fía + h, y1— Ue y + k) —f(2, y] = 
= p(x + h, y) — yla, y) = h pila + 045, y) = 
= h[fi + Oah, y + k) — f lx + 65h, y)] = 
= hkfi,(a + 04h, y + 6,k). 


We have on comparing these two expressions for w: 


Hfike + 6, y + 6,5) = Wk fs, we + Oh, y + 0h) 
or 


In + 6,h, y + 01k) = fx + 045, y + 6,k). 
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Letting h and k tend to zero, we obtain on the assumption that 
the second order derivatives written are continuous: 


fyc, y) = Jis y) . 


This discussion leads to the following theorem: 

THEOREM. If f(x, y) has continuous derivatives fy.(x, y) and fzy(x, y) 
inside a certain domain, these derivatives are equal at all interior points 
of the domain. 

We now consider two third order derivatives: 


fray (2, y) and fysa (2, v), 
differing only in the order of differentiation. Noting that twice repeated 
differentiation is independent of the order of differentiation, in accor- 
dance with the result proved, we can write: 


ant 0? x , 0? x , Hn 
paty) = RS = ROM = fige (8, 9) = 


= yax (x, y) = Ia (x, y), 


ie. in this case also, the result is not affected by the order of differen- 
tiation. This property can easily be generalized for derivatives of any 
order, and for functions of any number of variables, and we can state 
the general theorem: the result of differentiation does not depend on the 
order in which the differentiations are carried out. 

We remark that use was made in the proof, not only of the existence 
of the derivatives, but also of their continuity inside a certain domain. 

In future, we shall always assume the continuity of the derivatives 
that we discuss, and by the theorem just stated, it will only be necessary 
to indicate, for derivatives of higher orders, the order n of the deri- 
vative, the variables with respect to which differentiation is carried 
out, and the number of differentiations. 

For instance, in the case of w = f(x, y, 2, t), the following notation 
is used: 


O"f(z, y, 2, t) Ow 
B g an g 3 
Ax dy? Oz" O° ^. O2 By? Oy" Ot 





(a+ B y 5n), 


indicating that the nth order derivative is taken, with differentiation 
carried out a times with respect to x, f times with respect to y, y times 
with respect to z, and 6 times with respect to f. 
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156. Differentials of higher orders. The total differential du of a 
function of several variables is in turn a function of these variables, 
and we can define the total differential of this second function. We thus 
get the second order differential d%u of the original function u, and 
again, this will be a function of the same variables and its total 
differential will bring us to the third order differential d?u of the origi- 
nal function, and so on. 

We consider in detail the function of two variables u = f(x, y), 
where x and y will be assumed to be independent variables. By defi- 
nition, 





When obtaining d?u, we take into account the fact that the differ- 
entials dy and dy of the independent variables are to be regarded as 
constants, so that they can be taken outside the differentiation sign: 


deu = à [AY da] +a [AD ay] = 
= d.d AEB + ay.a DEO a 


— dr. [E dz + ue dy] + 


f(x, y) 9? 5 zy 
+ dy [GA o eae ie RN 


Of (a, fx, 
= EIE dat + o FREY dedy + PREY ay. 
We obtain d?w in exactly the same way: 


diy = LA Pre, Y) de + 3 2 Phe, A da? dy + 


9? p y) 9? IAS ry) 
+ 3 a W gs dg. 
These expressions for d%u and dèu lead us to the a symbolic 
formula for the differential of any order : 


deu = ES dz + $ dy)"f. (13) 


The formula implies that the sum in brackets is to be worked out by 
Newton’s binomial formula for degree n, then the various exponents 
of the powers of 8/0x and 9/0y are to be taken as indicating the orders 
of the derivatives of the function f with respect to x and y. 
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We have seen that (13) is justified for n equal to 1, 2, and 3. The full 
proof requires the use of the usual method, of assuming that the for- 
mula is true for n and proving that it is then true for (n + 1). Thus 
we find the differential of order (n + 1): 





u | O(d"u) 9(d'u) , (90 a 
dy = d (du) = PO de + S dy = (47 du + dy) d'u, 
where the symbol of the general form 


9 9 
(iz dz + du dy) 9 
denotes 
9p Op 
Be I2 + Gy W- 


Noting that formula (13) for d'u is assumed proved, we can 
write: 
n, (2 ə ə Bons B 


= (oz +g u) t, 


i.e. the formula is proved for d"*!w. 

Formula (13) is easily generalized for the case of a function of any 
number of independent variables. As we know [153], (13) is true, not 
merely when x and y are independent variables. When deducing the 
expression for d?u, however, it was essential to assume that dx 
and dy are constants, and (13) is only true when da and dy can be 
regarded as constants. 

This is the case when x and y are in fact independent variables. 
We now suppose that x and y are linear functions of the independent 
variables z and t: 

x = az + bt + c, y = agg + bit + c4, 


where the coefficients and the absolute terms are constants. We ob- 
tain for dx and dy: 


dx = adz + bdt, dy = a,dz + bdt. 


But dz and di are differentials of independent variables, and must 
be considered as constants; the same can therefore be said in this case 
as regards de and dy. We can thus assert that symbolic formula (13) 
is valid both when x and y are independent variables, and when they are 
linear functions (integral polynomials of the first order) of the indepen- 
dent variables. 


410 FUNCTIONS OF SEVERAL VARIABLES [157 


If dz and dy cannot be regarded as constants, (13) is no longer true. 
We find the expression for d%u in this general case. We are no longer 
justified in taking dx and dy outside the differentiation sign, as was 
done above, when finding 


LED az) and (AED ay). 


and the formula for differentiating a product has to be used [153]. 
We thus get: 


»Y >» > 9, , 
d2y — dad PY Hey ) y dy q BY MU qu ot) dex + ELS dy. 


The sum of the first two terms on the right-hand side of this equation 
gives us the expression that we had above for d?u, and we finally 
get: 


d?u = E y) da? + 2 Hm de dy + fee) DASS. dy? + 
dp MM Mey, dg £+ is Me) as, (14) 


ie. in the general case, the expression for d?u contains additional 
terms, depending on d?x and d*y. 


157. Implicit functions. We now indicate a rule for differentiating 
functions that are given implicitly. We shall assume here, that the 
equations written down in fact define a certain function, having the 
corresponding derivatives. We prove this for certain conditions in [159]. 
If y is an implicit function of z: 


F(x, y) = 9. (15) 
the first derivative y” of this function is defined, as we know, by the 


equation [69]: 
File, y) + Py, y) y’ = 0. (16) 


We derived (16) by assuming that y is a function of x in (15) and 
differentiating both sides of (15) with respect to x. By dealing with (16) 
in the same way, we get an equation for the second derivative y”: 


P(x, y) + 2Fy(a, y) y’ + Fido, y) y? + Fy(z, y) y” 20. (17) 


On differentiating this equation with respect to x, we get an equa- 
tion for the third derivative y”, and so on. 
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We note the fact that, in the equations thus obtained, the coeffi- 
cients of the required derivatives of the implicit function are all the 
same, viz., F;(x, y), and hence, if this coefficient differs from zero for 
certain values of x and y satisfying (15), the above method will give, 
for these values, completely defined values for the derivatives of any 
order of the implicit function. The existence of the partial derivatives 
on the left-hand side of (15) is assumed here, of course. 

We take the equation with three variables: 


Dx, y, z) = 0. 


This type of equation defines z as an implicit function of the inde- 
pendent variables z and y, and if z were to be replaced on the left- 
hand side of this equation by the function of x and y that it represents, 
the left-hand side would become identically equal to zero. Thus, assum- 
ing that z is a function of the independent variables x and y, we must 
get zero on differentiating the left-hand side of the equation with 
respect to x, and with respect to y: 


Di (x, y, 2) + Dilz, y, 2) zx = 0, 
Di (x, Y, 2) + Dix, y; 2) zy =0. 


The partial derivatives of the first order 2, and zy are found from 
these equations. If we differentiate the first of the relationships written 
a second time with respect to x, we obtain an equation for the partial 
derivative 25; and so on. The coefficient of the required derivative 
will be D;(x, y, 2) in all the equations obtained. We now consider the 
system of equations: 


p(x, Y, 2) = 0, p(x, Y, 2) =0. 


We shall assume that this system defines y and z as implicit func- 
tions of x. Differentiation of both equations of the system with respect 
to x, on the assumption that y and z are functions of v, gives us a 
system of equations of the first degree, defining the derivatives y’ 
and z’ of y and z with respect to x: 


ex(z, y, 2) + pra, Y, 2) y + ple, y 2) z = 0, 
Po, y, 2) + vy y. 2) * y! + palo, y, 2) z = 0. 


Differentiation of these relationships once more with respect to x 
gives us a system of equations defining the second derivatives y” 


412 FUNCTIONS OF SEVERAL VARIABLES [158 


and 2”. Further differentiation with respect to x gives a system of 
equations defining y”” and 2”, and so on. 

The nth order derivatives y and 2 will be defined here by a 
system of the form: 


pra y 2) * y + pim y 2) 2 +A=0, (17,) 
Pilg, y, 2) * ya + pla, Y, 2) ° ZO +B=0, 


where A and B are expressions containing derivatives of orders lower 
than n. By elementary algebra, such a system will give a unique 
solution if the condition is satisfied: 


P(X, Y, 2) * vix, Y, 2) — pila, Y, 2) * vy, y, 2) 40. 


The method described above leads to fully defined values of the 
derivatives for all x, y, 2 for which this condition is fulfilled, and which 
satisfy system (17,). 

If a system of m equations in (m + n) variables is given, the system 
generally speaking defines m variables as implicit functions of the 
remaining n variables, and the derivatives of these implicit functions 
can be obtained by the above method as a result of differentiation of 
the equations with respect to the independent variables. 


158. Example. Wo take as an example the equation 
ax? + by? + c= 1, (18) 


which defines z as a function of x and y. We get on differentiating with 
respect to a: 
ax + ez 2,70, (19) 


and similarly, differentiating with respect to y: 


by + cz 52,7 0, (19,) 
whence 
A eli Dur 
cz ez 


We obtain on differentiating (19) with respect to x, and with respect 
to y, and (19,) with respect to y: 


a+ cz/2 +. ez» zye — 0, Czy zy + 02-25 = 0, b + oz’? cz zg = 0 
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whence: 
22 
atx 
, ate—— 
T a+ ce Tu acz? + atat 
2 == = — 
e Cz ez c23 > 
d + 
wit x2y abxy 
xy 2 c2z3 ^ 
gui a czy bez? + bey? 
y cz c?23 


We now give a second method of finding the partial derivatives, using 
the expression for the total differential of a function. We first prove an auxiliary 
theorem. Suppose we are asked to find an expression for the total differential 
dz of a function of two independent variables x and y in the form: 


dz = pdx + qdy. 
We already know that 
dz = zy dx + z, dy. 


We find, on comparing these two equations: 
pdx + gdy = z,dx + zy dy. 


But dz and dy are constants, being the differentials of the independent 
variables. We put dz = 1 and dy = 0, or dz = 0 and dy = 1, and find 
p—z, and q=2;. 


Thus, ¿f the total differential of a function z of two independent variables 
w and y can be put in the form : 


dz = pdz + qdy, 
we have p = z, and q = zy. 
This theorem is valid for & function of any number of independent variables. 
It can be shown similarly that, 4f the second order differential can be put in 
the form : 


d?z = rdg? + 2sdxdy + tdy?, 


we have r = zyz, 8 = zy andt= zye. 

We now return to our example. Instead of finding the derivatives of the 
left-hand side of (18) with respect to x and y, we find its differential, whilst 
recalling that the expression for the first differential is independent of the 
choice of independent variables [153]: 


axdx + bydy + czdz = 0, (20) 
whence 
dime 2? de — 2Y dy, 
cz cz 
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and therefore, by the theorem just proved: 


We now find the differential of the left-hand side of (20), remembering 
that dz and dy have to be regarded as constants here: 


adz? + bdy? + cdz? + ezd?* z = 0 











or 
dz= — de — qa l a des iui 
cz cz z cy cz 
l fax 2 acz? + ata? ; abzy bez? + by? a , 
ae E da + 28. dy) = TEE dai 2 SY dady E ays, 
and therefore: 
» acz? + ata? » _ abay D bez? + by? 
Hace e uec dE Ade ye ege n 


Thus, having found the differential of a certain order, we can obtain all 
the partial derivatives of the corresponding order. 


159. Existence of implicit functions. Our discussions have been of a formal 
kind, since we have presupposed in every case that the equation or system 
of equations implicitly defines a certain function that possesses derivatives. 
We now prove the basic existence theorem for implicit functions. 

We take the equation 

F(x, y) = 0 (21) 


and indicate under what conditions it uniquely defines y as a continuous 
function of x that possesses derivatives. 
THEOREM. Let x = x, and y = y, be solutions of equation (21), i.e. 


F(£o Yo) = 0; (22) 


let F(x, y) and its partial derivatives of the first order with respect to x and y 
be continuous for all x, y sufficiently close to £o y, and finally, let the partial 
derivative Fy (x, y) differ from zero for x = Lo, Y = ye. In these circumstances, 
there exists a uniquely defined function y(x) for all x sufficiently close to x5, 
which satisfies equation (21), és continuous, possesses a derivative and satisfies 
the condition : y£) = Yo: 

We take for clarity, F’(z, y) > 0 at x = a, y = Yo Since this derivative 
is continuous by hypothesis, it will also be positive for all x, y sufficiently 
near La, Yo, le. there exists a positive number l such that F(x, y) and its 
partial derivatives are continuous, and 


F(x, y) > 0, (23) 
for all x, y, satisfying the condition: 


[== |<bl|y—yl<!. (24) 
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Moreover, by (22), function F(x, y) of the single variable y vanishes for 
y = Yo, Whilst by (23) and (24), it is an increasing function of y in the interval 
(Yo — l: yo + D. The numbers F(z, Y, — l) and F(x, Yọ +1) thus have 
different signs: the former is negative, the latter positive. We can assert, 
on taking into account the continuity of F(x, y) [67], that F(x, Y, — l) 
will be negative, and F(z, y, + l) positive, for all x sufficiently near 2,, 
i.e. there exists a positive number /,, such that 


for | v — a | < l. Let m denote the smaller of 1, l. We can say, from (24) 
and (25), that inequalities (23) and (25) are satisfied if x and y satisfy the 
inequalities: 

[a—=|<mly—yol<l. (26) 


If we take any definite x, lying in the interval (x, — m, zx, +m), i.e. 
satisfying the first of inequalities (26), we can see by (23) that F(x, y), con- 
sidered as a function of y, will be increasing in the interval (y, — l, y, + D), 
and by (25), it will have different signs at the ends of this interval. It will 
therefore vanish for a uniquely defined value of y in the interval. In parti- 
cular, if z = z,, this value of y will be y = yo, by (22). We have thus proved 
the existence of a definite function y(x) in the interval (z, — m, x, + m), 
representing a solution of equation (21) and satisfying the condition y(x,) = 
= yo In other words, it follows from the foregoing arguments that equation 
(21) has a unique root, inside the interval (y, — l, Y, + l), for every fixed 
æ from the interval (xy — m, 2, + m). 

We now show that the function obtained, y(x), is continuous at x = x». 
In fact, given any small positive e, the numbers F(x», Y, — e) and F(x, 
Yo + €) will have different signs, by (25): so that a positive 7 will exist, 
such that F(x, y, — e) and F(a, y, + e) have different signs, provided only 
that | æ — zo| < y. Thus, given | x — x| < y, the root of equation (21), 
ie. the value of our function y(x), satisfies |y — Ya | < s, which proves 
the continuity of y(x) at x= x». 

We now prove the existence of the derivative y'(x) at «= v,. Let x — 
— xo = Ax, and let the corresponding increment of y be dy = Y — Yo. 
Hence, x = æ, + da and y = y, + Ay satisfy (21), ie. we have F(a, + 
+ Aa, Yo + Jy) = 0, and we can write, by (22): 


F(x, + Ax, yo + Ay) — F(2o Yo) = 0. 
We can use the continuity of the partial derivatives to rewrite this equation 
as [68]: 
[Pg (Lo Yo) + €1] 4e + LFy no, Yo) + &] dy = 0, (27) 


where e, and e, > 0 if Az and dy > 0, and where Fy (x, Yo) and Fy (£o Yo) 
denote the values of the partial derivatives at x = x, y = Yo. It follows 
from the continuity proved above that 4y > 0 if 4x > 0. 

Equation (27) gives us: 


Ay Fo (2o Yo) + 1 
Ax Fio (Lo Yo) + €s ' 
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we pass to the limit as 4x > 0, and get: 


Fx, (To, Yo) E 
Fio (Zo, Yo) 





y’ (z) = — 


We have proved the continuity of y(x), and the existence of its derivative, 
only for x = x,. On taking any other value of x from the interval (x, — 
— m, xo +m), and the corresponding value of y from the interval (y, — 
— l, y, + l), representing the root of (21), all the conditions of our theorem 
are again satisfied for this pair of values of x and y, and by what has been 
proved, y(x) will be continuous and will have a derivative for the value of 
æ concerned. 

We can state and prove, in exactly the same way, a theorem regarding 
the existence of the implicit function z(z, y), defined by the equation: 


P(x, y, z) = 0. 
We now consider the system: 


p(2,y,2)=0, p(z, y, 2) — 0, (28) 
defining y and z as functions of x. 

The following theorem holds in this case: 

THEOREM. Let x = £o y = Yo, Z= Z be solutions of system (28); ler 
plz, y, 2), p(x, y, 2) and their first order partial derivatives be continuous func- 
tions of (x, y, z) for all values of these variables sufficiently near (£o, Yu, 20); 
and let the expression 


Py(®s Y, 2) yola, y, 2) — pala, y, 2) y, (2, y, 2) 


differ from zero for £ = £o Y = Yo, Z = Zo In these circumstances, there 
exists for all x sufficiently near x, a uniquely defined system of two functions 
y(x), z(x), that satisfies equations (28); these functions are continuous, have 
first order derivatives, and satisfy the conditions: y(x) = Yo, 207) = Zo» 

We shall not dwell on the proof of this theorem. The general case of any 
number of functions with any number of variables is considered in the 
third volume. 


160. Curves in space and surfaces, We know from analytic geometry 
that, generally speaking, for every equation with three variables 
F(x, y,2) = 0, (29) 
or explicitly, 
z= f(x,y), (30) 
there is a corresponding surface in space, relative to rectangular axes 
OX, OY, OZ. 
A line in space can be considered as the intersection of two surfaces, 
and can therefore be defined by a combination of two equations: 


F(x, y, z) = 0, Fax, y, z) = 0. (31) 
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Alternatively, a curve can be defined by parametric equations: 


x = p(t), y= v(t), z = o(t). (32) 


The length of arc of a curve is defined, as in the case of a plane curve, 
as the limit of the perimeter of a series of joined chords inscribed in 
the arc, on indefinite decrease of the length of each chord. We omit 
the proof, since it follows exactly the same lines as in [103] for the 
case of a plane curve, that the length of arc is expressed by the definite 
integral: 








(mM) ty 
s= | eap + (dy)? + (dz)? = f Yo? (t) + y? (t + 0 (1) de, (33) 
(it, T 


where f, and £4 are the values of the parameter ¢ corresponding to the 
ends M, and M, of the arc, and the differential of the arc has the 
expression: 





ds = V (da? + (dy) F (da? (84) 


If the role of parameter ¢ is played by the length of arc s, measured 
from some given point of the arc, it can be shown, by an exactly 
similar method to that used in the case of a plane curve [70], that the 
derivatives dz/ds, dy/ds, dz/ds are equal to the direction cosines of the 
tangent to the curve, i.e. are equal to the cosines of the angles formed 
by the positive direction of the tangent with the coordinate axes. 
Thus, the direction cosines of the tangent to the curve at the point (x, y, 2) 
of the curve, i.e. the cosines of the angles formed by the direction of the 
tangent with the coordinate axes, are proportional to dx, dy and dz, 
and the equation of the tangent can be written in the form: 


X—xz Y-y_ £-2 
de dy dz’ 





(35) 


or 
X — o (t) Y—w(0  Z-—o(). 
eo wt of) (36) 











We now introduce a new concept, that of the tangent plane to the 


surface: 
F(x, y, z) = 0. (37) 


Let M(x, y,z) be a point of the surface, and let L be a line on 
the surface passing through the point M. The coordinates of points 
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of the line are functions of some parameter £, whilst the functions 
satisfy equation (37), since the line L lies on the surface. Equation 
(37) must be satisfied along the whole of line L, i.e. for all values of f, 
and we can write here, on differentiating the left-hand side of (37): 


Fix, y, 2) de + Fy (a, y, 2) dy + Fra, y, 2) de = 0. (38) 


We assume that F(x, y, 2) has continuous partial derivatives 
Fy, Fy, F} at least one of which is non-zero. 
But we know from analytic geometry that an equation of the form 


aa, + bb, + cc, = 0 


is the condition for two straight lines, one with direction cosines 
proportional to a, b, c, and the other with direction cosines propor- 
tional to a,, b,, c,, to be perpendicular. As we have seen, dz, dy, dz 
are proportional to the direction-cosines of the tangent to the line L at 
the point M. Equation (38) thus shows that the tangent to the line Lat 
the point M is perpendicular to some specific direction, independent of 
L, the direction, cosines of which are proportional to Fx(x,y,2), F;(x,y, 2), 
and F:(x, y, 2). We see that the tangents to every line L, lying on the 
surface (37), and passing through the point M, lie in the same plane 


A(X —2) + BY — y) - (2 — 2) — 0, (39) 


which is referred to as the tangent plane to the surface at the point M. 

We know from analytie geometry that the coefficients A, B, C 
in the equation of the plane are proportional to the direction cosines 
of the normal to the plane, i.e. proportional to P,(x=, y, 2), Fy(x, y, 2), 
F;(x, y, 2) in the present case; so that the equation of the tangent 
plane can finally be written as: 


Pix, y, 2) (X — 2) +- F(a, y, 2) (Y — y) + Fi y, 2) (Z — 2) = 0, 
(40) 


where X, Y, Z are the current coordinates of the tangent plane and 
x, y, z aro the coordinates of the point of contact M. 

The normal to the tangent plane, passing through the point of contact M, 
is called the normal to the surface. Its direction cosines are proportional 
to the partial derivatives Fy(x, y, 2), Fy(x, y, 2), F(x, y, 2), as we have 
just seen, and its equation is therefore: 


X-—z- Y —y Z—2 


Fx (c, y, 2) Fg (æy, o —— Fiayz 








(41) 
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If the surface is given by the explicit equation: 2 = f(x, y), equation 
(37) takes the form: 


F(x, y, 2) = f(x, y) —2=0, 
and therefore: 


Fix, y, 2) = falx, y), Fyle, y, 2) = falx, y), Filo, y, 2) = —1. 


If we adopt the usual notation of p for fz(z, y), and q for f,(x, y) 
we obtain for the equation of the tangent plane: 


p(X — 2) + Y — y — (2-2) =0 (42) 


? 


and for the normal to the surface: 


KX-e@ Yury — Z-z. 
P q4 E (43) 








In the case of the ellipsoid 
q? y? 22 
a te taal 
the equation of the tangent plane at any given point of (x, y, 2) is: 


2 (x — ay + Hy —y) -= 


or 
aX 


a? 


YY Z e p z 
dm E ^ Lit 


c? a? 





The right-hand side of this equation is simply equal to unity, since the 
coordinates (z, y, 2) of the point of contact must satisfy the equation of the 
ellipsoid; thus the equation of the tangent plane is finally: 


xXx yY zZ 
a? + b2 + = 


c? 





8 16. Taylor's formula. Maxima and minima 
of functions of several variables 


161. Extension of Taylor's formula to functions of severalindependent 
variables. We confine ourselves to the function f(x, y) of two independent 
variables, so as to simplify the writing. Taylor's formula gives the 
expansion of f(a +h, b -+ k) in powers of the increments h and k 


420 FUNCTIONS OF SEVERAL VARIABLES (161 


of the independent variables [127]. We introduce a new independent 
variable ¢ by putting: 


xz =a + ht, y — b + kt. (1) 
This gives us a function of the single independent variable t: 


p(t) = f(x, y) = fla + ht, b + kt), 


where 
(0) = f(a, b) and (1) = f(a + h, b + E). (2) 


We can write, using Maclaurin’s formula with Lagrange’s remainder 
term [127]: 








(n) (141) 
a tay ONS Dy — d 
We now express the derivatives p(0) and p“+*P(0) in terms of 
the function f(x, y). 
From (1), x and y are linear functions of the independent variable t, 
and 
da = hdt, dy = kdt. 


We can thus use our symbolic formula for finding the differential 
of any order of function g(t) [156]: 


0 (p) (p) 
d? g (t) = (ge da +3, dy)” fen = (rtu) f (o, y) dt^ , 


whence 





sog - TE LEE 


We have, for t = 0, x = a and y = b, and for t = 0, xv =a -- 0h 
and y = b + 0 k, and hence: 


) 
g® (0) = (^ * + bx) f(a,b), 


(n+1) 


pint) (9) = (a a + 23 f (a 4- 0h, b + Ok). 
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We substitute these expressions in (3), and make use of (2) again, 
and finally obtain Taylor's formula: 


f (a hb + E) =f (a,b) + (hoz sm + 


l f,9 940 1/,8 9 ya 
+37 (sz) fab +... + oy (begs +h ay) f(a, b) + 


1 0 9 

+ (n 4- 11! la +k ob 

If we replace a and 6 by x and y respectively in this formula, denote 

the increments h and k of the independent variables by dz and dy, 

and denote the increment of the function, ie. f(x + dx, y + dy) — 
— f(x, y), by A f(x, y), we can write the formula as follows: 








f (a + 6h, b+ 6k). (4) 





Jos 





2 n nii 
A+ HEP. e. ER [TTE age 
y+0 dy 

The right-hand side of this formula contains the differentials of 
various orders of f(x, y), whilst the values that have to be substituted 
for the independent variables in the (n + 1)th order derivative are 
indicated for the last term. As in the case of a function of a single 
independent variable, we can deduce Maclaurin's formula, giving an 
expansion of f(x, y) in powers of z and y, by setting in Taylor's for- 
mula (4): 

a=0,b=0;h=x, k= y. 


We assumed, when obtaining (4), that f(x, y) has continuous partial 
derivatives up to order (n + 1) in some open domain that contains 
the straight line joining points (a, b) and (a + h, b + k). The point 
x =a + ht, y = b + kt, describes this line as ¢ varies from zero to 
unity. We obtain the formula for finite increments, with n — 0: 


f(a 4- h, b + k) — f(a, b) = hfaa + 0h,b+0k) + 
+ kfi(a 4- 6h,b+ 0 E). 
It immediately follows from this, as in [63], that if the first order 


partial derivatives are everywhere zero in a certain domain, the function 
maintains a constant value in the domain. 


162. Necessary conditions for maxima and minima of functions. Let 
function f(z, y) be continuous at, and in some neighbourhood of, 
the point (a, b). We shall say, by analogy with the case of a single 
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independent variable, that the function f(x, y) of two independent 
variables attains a maximum at the point (a, b), if f(a, b) is not less than 
all adjacent values of the function, i.e. if 


Af =f +h, b+ k) — f(a, 0) <0, (5) 


for all h and k that are sufficiently small in absolute value. 
Similarly, we say that f(x, y) attains a minimum at x = a, y = b, if 


Af — f(a +h, b -- k) — f(a b) > 0 (5) 


for all h and k sufficiently small in absolute value. 

Now, let x = a, y = b be values of the independent variables for 
which function f(z, y) attains a maximum or minimum. We consider 
the function of a single independent variable f(x, b). By hypothesis, 
it must attain a maximum or minimum for x = a, so that its deriva- 
tive with respect to x must either vanish or not exist at x = a [58]. 
Using the same argument, we see that the derivative of f(a, y) with 
respect to y must either vanish or not exist at y = b. This leads us 
to the following necessary condition for a maximum or minimum: 
the function of two independent variables f(x,y) can only attain a 
maximum or minimum at values of x and y for which the first order 
derivatives Of(v, y)[0x and Əf(x, y)[0y either vanish or do not exist. 

Similarly, letting either only x or only y vary, we can assert by 
what was said in [58] that, given the existence of the second order 
derivatives, a necessary condition for a maximum is given by 
0? f(x,y) [3x < 0 and 9? f(x, y)/2y? < 0, and a necessary condition for 
a minimum by 9? f(x, y)/0x? > 0 and 9? f(x, y)/ay? > 0. 

The above reasoning remains applicable for functions of any number 
of independent variables. We can thus state the following general 
rule: 

A function of several independent variables can attain a maximum 
or minimum only for those values of the independent variables for which 
the first order derivatives either vanish or do not exist. We shall in future 
limit ourselves to considering cases when the partial derivatives 
mentioned exist. 

The first order differential is equal to the sum of the products of 
the partial derivatives with respect to the independent variables and 
the differentials of the corresponding independent variables [153], so 
that we can say that the first order differential of a function must 
vanish at the values of the independent variables for which the function 
has a maximum or minimum. This form of necessary condition is con- 
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venient, since the expression for the first differential does not depend 
on the choice of variables [153]. By equating the first order partial 
derivatives to zero, we get a system of equations, defining the values 
of the independent variables for which the function can attain maxima 
or minima. A complete solution requires further investigation to find 
out if the function in fact attains maxima or minima at the values 
obtained of the independent variables, and if it does, to say which 
of the two it attains, a maximum or a minimum. We indicate in the 
next section how the investigation proceeds for a function of two 
independent variables. 


163. Investigation of the maxima and minima of a function of two 
independent variables. Let the system of equations 


Of, y) — 9fmy) — 
“a =0 , ^99 =0 ? (6) 


expressing the necessary condition for a maximum or minimum, give 
us the values z — a, y = b, that are to be investigated. We assume 
that f(x, y) has continuous partial derivatives of the second order at 
and in some neighbourhood of the point (a, b). 

We can write, in accordance with Taylor's formula (4), with n = 2: 





flath btk) =F (a,b) + LO) 7 4 POP) p 
1 [9 (2, y) Of (v, y) OF (x, y) 
uo E a a 
y- 


We use the fact that x = a, y = b represents a solution of system (6), 
in order to rewrite the above equation as: 


Af = f (a +h, b + k) — f (a,b) = 
l [8f (2, y) Ed ef (x, y) , 
—Jgr [A Uppi o hk + d (7) 


Jx=a+6h 
y=b+0k 





We put: 
r= \Vh +k, h=rcosa, k=rsina. 


When h and k are small in absolute value, r is also small, and vice 
versa; the condition that h and k — 0, and the condition that r — 0, 
are thus equivalent. 
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We now have for (7): 


Af = 4 | oe V) cos? a +2 eS cos a sin a -+ 


Of (x,y) a ] 
+ dy? sin? a x-a4oh 
y=b+0k 


(8) 


We can say by the continuity of the second order derivatives, on 
taking h and k, or what comes to the same thing, r, as infinitesimals, 
that evaluation of the derivatives on the right-hand side of (8) for 
a+6@h,6+ 0 k, which differ infinitesimally from a, b, gives results 
that themselves differ infinitesimally from the numbers: 


of (a,b) _ 0% (a, b) flab) on, 
Ja? — 4, 9a 0b =B, 9b: =0; 








the coefficients of cos? a, cos a sin a, and sin? a in the square brackets 
in (8) can therefore be replaced by: 


A+ €, 2B + e, C + & 
respectively, where £}, e,, and e, are infinitesimals along with h and k 
(or r). 
We can now write (8) in the form: 


Af = + [Acosta + 2Bsin a cosa + Osin?a + e]. (9) 
where 


e = & cos? a + 2e, cos a sin a + e, sin? a 


is an infinitesimal along with A and k (or r). 

It follows from the definition of a maximum or minimum that, if the 
right-hand side of (9) preserves a (—) sign for all sufficiently small r, 
the values x = a and y = b give a maximum of f(x, y); whilst they 
give a minimum if the (+) sign is preserved. Finally, if the right-hand 
side of (9) can have both a (+) and a (—) sign for arbitrarily small 
values of r, x = a and y = b give neither a maximum nor a minimum 
of the function. 

There are four possible cases when investigating the sign of the 
right-hand side of (9): 

I. If the expression 


A cos? a + 2B sin a cos a + C sin? a (10) 


does not vanish for any value of a, it preserves the same sign, since it 
is a continuous function of a [55]. Let the sign preserved be (4-). 
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It must attain a least (positive) value in the interval (0, 277), say m, 
which will hold for any a, due to the periodicity of sin a and cos a. 
We shall have | e | less than m for all sufficiently small r, and the sign 
of the right-hand side of (9) in this case will be given by the sign of (10), 
i.e. will be (+). We thus have a minimum in this case. 

II. Now let (10) preserve the (—) sign, whilst not vanishing for 
any a. Let m be the greatest (negative) value of (10), for a varying 
in the interval (0, 2). We have | e | less than m for sufficiently small r, 
in which case the sign of the right-hand side of (9) remains (—), 
i.e. we have a maximum here. 

III. We now suppose that (10) changes sign. Let it be equal to the 
positive number +m, for a = a,, and equal to the negative number 
—m, for a = az. We shall have | e | less than m, and m, for all suffi- 
ciently small r, and with these r, and a = a, or a, the sign of the 
right-hand side of (9) is determined by the sign of (10), i.e. is (+) 
for a = a, and (—) for a = a,. It follows that the sign of the right 
of (9) can here be both (+) and (—) for arbitrarily small r, i.e. we 
have neither a maximum nor a minimum in this case. 

IV. We finally suppose that (10), whilst preserving an invariable 
sign, can vanish for several values of a. Further investigation would 
be needed into the sign of e, for us to be able to draw any conclusions 
regarding the sign of the right-hand side of (9); so that this case remains 
doubtful in our present study. 

All the above leads to investigation of the sign of (10) as a varies, 
and we indicate some simple tests that enable us to decide with which 
of the four cases we are concerned. 

l. We take A z 0 to start with. We can write (10) in the form: 


(A cos a + B sin a)? + (AC — B?) sin? a 1 
A š ( 1) 





If AC — B? > 0, the numerator of fraction (11) consists of the sum 
of two positive terms, which cannot vanish simultaneously. The 
second term vanishes, in fact, only when sin a = 0, in which case 
cosa = +1, and the first term becomes A? 4 0. The sign of (11) is 
thus the same as the sign of A in the present case, so that we have 
case (I) with A > 0, i.e. a minimum, and case (II) with A < 0, 
i.e. a maximum. 

2. We take A #0 as before, and suppose that AC — B? < 0. 
The numerator of (11) will have the (+) sign with sin a — 0, and 
the(—)sign with cot a = — B/A, so that we obtain case (IIT) in these 
circumstances, i.e. neither a maximum nor a minimum. 
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3. If we take AC — B? = 0, with 440, the numerator of (11) 
reduces to the first term; it retains the invariable (+) sign, whilst 
vanishing at cot a = —(A/B), i.e. we have here the doubtful case (IV). 

4. We take A = 0, B¥ 0. The expression (10) now has the form: 
sin a (2B cos a + C sin a). The expression in brackets preserves an 
invariable sign for a close to zero, which is the same as the sign of B; 
whereas the first factor, sin a, has different signs depending on whether 
a is greater or less than zero. We thus have case (III) here: neither a 
maximum nor a minimum. 

5. We finally take 4 = B = 0. Expression (10) now reduces to 
the single term C sin? a. 1t can therefore vanish, whilst not changing 
sign, i.e. we have the doubtful case. 

Noticing that AC — B? < 0 in case 4 above, whilst AC — B? = 0 
in case 5, we are able to state the following rule: 

The procedure for finding the maxima and minima of a function 
f(x, y) of two independent variables x and y is to obtain the partial deri- 
vatives fz(x, y) and fy(x, y) and solve the system of equations : 


ful, y) = 0, file, y) = 0. 
Let x = a, y = b be a solution of this system. We put 


= ACh ee Of (a,b) _ 
mir — 4 7 P, 9p? =0, 


3?f (a, b) 


9a? 


and investigate the solution in accordance with the following arrange- 
ment : 





um] + |] = | o? 





us ck MR 
A no min. doubtful 
no max. case 





min. | max. 





164. Examples. 1. We consider the surface z — f(x, y). The equation of 
the tangent plane to it is [160]: 


p(X — x) + «(Y —9)—(2—2)—0, 


where p and q denote the partial derivatives f¿(w, y), Io, y). 

If z attains a maximum or minimum for certain values «=a, y = b, 
the corresponding point is called a vertex of the surface; the tangent plane at 
such a point must be parallel to the XY plane, i.e. the partial derivatives 
p and g must vanish, and the surface must be situated on one side of the 
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tangent plane near the point of contact (Fig. 163). It may happen, however, 
that p and g vanish at a certain point, i.e. the tangent plane is parallel to 
the XY plane, yet the the surface is situated on both sides of the tangent 
plane in the neighbourhood of the point; in this case, 2 attains neither a 
maximum nor a minimum for the x, y concerned. 

We mention the further possibility, that comes under our heading of a **doubt 
ful case”. We suppose that the tangent plane is parallel to the XY plane at 
x =a, y= b, and that the surface, whilst lying on one side of the tan- 
gent plane, has & line in common with it, 
passing through the point of contact. Here, 
although the difference 


f(a 4 h,b + k) — f(a, b) 


does not change sign for h and k that are suf- 
ficiently small in absolute value, it vanishes 
for non-zero h or k. This case is easily obtain- 
ed, as, for instance, with a circular cylinder, 
the axis of which is parallel to the X Y plane. 
The function f(x, y) is also spoken of here as 
having a maximum or a minimum at x= a, 
=b. 
The surface Fig. 163 
xz? y? 
camer XU x 
is & hyperbolie paraboloid. Equating to zero the partial derivatives of z 
with respect to x and y gives us x = y = 0, and the tangent-plane to the 
surface at the origin coincides with the XY plane. We obtain the second 
order partial derivatives: 








Əz 1 ez — oe 1 
dx? at? = Gar Oy > Oy? BB” 
and therefore, 
1 
AC ERES B? — => “arpa < 0 , 


ie. z attains neither a maximum nor a minimum at z = y = 0, and the 
surface is situated on both sides of the tangent plane near the origin (Fig. 
164). 

2. We are given n points M,(a;,b;) ( — 1,2, ...,n) on a plane. The 
problem is to find the point M, such that the sum of the products of certain 
given numbers m, and the squares of the distances MM, is a minimum. 

Let (x, y) be the coordinates of the point M. The sum referred to is: 


w= P m; [(e — aj)? + (y — by]. 


Equating the partial derivatives w and wy to zero gives: 





MA, + MAH... + Maan | mb, + mb, +... + Mabn . (12) 
Mim tb... +m, C m, + Ms... +My 





x= 
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It is easily shown that A and AC — B? are greater than zero here, so that 
the values found for x and y in fact give a minimum of w. This minimum is 
the least value of w in the (x, y) plane, since w > co as the point (x, y) moves 
to an infinite distance. 

If M; are material particles, of masses m;, formula (12) defines the coordinates 
of the centre of gravity of the system of particles M. 
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165. Additional remarks on finding the maxima and minima of a function. 
The above discussion can be extended to the case of a larger number of 
independent variables. For instance, let a function f(a, y, z) of three indepen- 
dent variables be given. The values of the independent variables, for which 
this function attains a maximum or a minimum, are found by solving the 
system of three equations with three unknowns [162]: 


Lilo, y, z) = 0, fy(m, y, 2) = 0, fila, y, 2) = 0. (13) 


Let x = a, y = b, z = c bo one of the solutions of this system. We indicate 
briefly the way to investigate these values. Taylor's formula gives the incre- 
ment of the function as a sum of homogeneous polynomials, arranged in 
powers of the increments of the independent variables: 


Of (a, b, c) 


af (a, b, c) 
eu ab 


9f ( 
da +k E 





+ 


a, b, c) 
9c 


1 9 9 910 
top [tas + eo Fx) feto 








9 y^ 


1 0 ə 
turo tt ow f (a + Oh, b + 8k, c -+ 0l) 


(0 « 6 « 1). (14) 
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Since x = a, y = b, z = satisfy equations (13), we have 


9f (a, b, c) Of (a, b, c) of (a, b, c) 
ha 55 3 05 $* 





=0. 


If the combination of terms to the second power in h, k, L, 
1 (2) 
ax) f (a, b, c) (15) 
does not vanish, the sign of the right-hand side of (14) is the same as that of 
expression (15) for hk, k, l sufficiently small in absolute value, and if the 
sign is (+), f(a, b, c) is a minimum of f(x, y, z), and if (—), a maximum. 
Tf (15) can have different signs, f(a, b, c) is neither a maximum nor a minimum. 
If, finally, (15) keeps its sign, but vanishes for some values of h, k, l, the case 
is a doubtful one, and further study is needed of the terms on the right- 
hand side of equation (14) that contain h, k, l, to higher powers than the 
second. 

We carry out a full investigation of a doubtful case for a particular example 
of a function of two independent variables: 


u = ai — lay + y? + x3 + y*. 
The partial derivatives 0u/0x and 0u/dy vanish for v = y = 0. We have 
further: 
2 2. 
9u = 2 Pu 2-3, 0 =2, 
=0 Ox @y | x=6 
=() =0 








dy? 
AC — B* —0, 


i.e. we are concerned with a doubtful case. It is a characteristic of this case 
that the combination of terms of the second degree in the expression for 
u consists of a perfect square, and we can write in this particular example: 


u = (x — y? + (x° + y°). 
When æ = y = 0, u also vanishes. We introduce polar coordinates: 
x = r cosa, y —rsina, 


in order to investigate the sign of u for x and y near zero. 
Substitution gives us: 


u = r?[(cos a — sin a)? + r(cos? a + sin? a)]. 
For any a, except 1/4 and 57/4, in the interval (0,27), 
cosa — sina £ 0, 


so that we can choose, for such an a, a positive number r, such that the sign 
of the expression in square brackets is (+) for r < rọ. The sign is likewise 
(+) for a = 2/4, but we get a (—) sign for a = 57/4, and u has therefore 
neither a maximum nor a minimum at «= y = 0. 
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We also consider the function 
u = (y — 2)? — z5. 


It is easily shown that the partial derivatives 0u/dx and 0u/0y vanish 
at x = y = 0, and that we have a doubtful case. We find, on taking x arbit- 
rarily small and putting y = 2?, that u becomes (—25), so that its sign depends 
on the sign of z, i.e. u has neither a maximum nor a minimum for x = y = 0. 
Introducing polar coordinates would have given: 


u = (sin? a — 2r cos? a sin a + r? cost a — ri cosè a) , 


and it is evident from this expression that a positive number r, can be found 
for any a, not excluding a = 0 and zx, such that u > 0 for r < r,, i.e. u has 
a (+) sign near the origin on any radius vector through the origin. But, as 
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we have seen, this does not in itself imply a minimum at the origin, where 
u = 0, since it is impossible to find an r, that is the same for all values 
of a. 

We plotted the curve (y — x?)? — xï = 0 in [76], and saw that it has a 
cusp of the second kind at the origin, whilst the left-hand side of this equa- 
tion has a (—) sign near the origin, if its value is considered at points lying 
inside the shaded region between the two branches of the curve (Fig. 165). 


166. The greatest and least values of a function. Suppose that we wish 
to find the greatest value (absolute maximum) of some function f(x, y), given 
in a certain domain. The procedure of [163] enables us to find all the maxima 
of the function within this domain, i.e. all the points inside the domain where 
the value of the function is greater than at neighbouring points. Finding 
the greatest value of a function involves taking into account its values on 
the boundary (contour) of the given domain, and comparing these values with 
the maxima inside the domain. The greatest of these values is the greatest 
value of the function in the given domain. The process is similar for finding 
the least value (absolute minimum) of a function in a given domain. The above 
remarks may be better understood with the aid of an example. 

A triangle OAB (Fig. 166) is formed on a plane by the axes OX, OY, and 
the line 

x+y-—1=0. (16) 
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The problem is to find the point in the triangle, for which the sum of the 
squares of tts distances from the vertices is a minimum. 

On noting that the vertices A and B have coordinates (1,0) and (0,1), 
we can write down an expression for the above sum of the squares of the dis- 
tances of a variable point (x, y) from the vertices of the triangle: 


z = 2a? 4 2y? + (x — 1} + (y — 1}. 


Equating to zero the first order partial derivatives gives x = y = 1/3, 
and these values are easily shown to give a minimum, with z = 4/3. We now 
consider z on the contour of the triangle. Taking z on the side OA implies 
putting y = 0: 

z = 22 + (x — 1} +1, 


where x can vary in the interval (0,1). We apply the method of [60] to find 
that z has a least value, z= 5/3, at the point O on the side 04, where x= 1/3. 
Similarly, z is found to have the least value 5/3 at the point D on side OB 
where y = 1/3. We investigate the value of z on side AB by putting y = 1 — 2, 
in accordance with equation (16), in the expression for z: 


z = 30 + 3(æ — 1), 


where x varies in the interval (0,1). The least value of z here is z=3/2, occur- 
ring at the point E, where x = y = 1/2. We thus obtain the following table 
of possible least values of the function: 


| 
1 1 1 1 
ee el 


à 11 
(y 33 3 


4 5 5 3 
E 3 3 3 3 


We see from the table that the least value of z, z=4/3, is obtained at 
the point (1/3, 1/3). A solution of this problem can be obtained for any tri- 
angle, the required point being the centre of gravity of the triangle. 





167. Conditional maxima and minima. We have so far considered 
the maxima and minima of a function, on the assumption that the 
variables concerned are independent variables. We now turn to 
functions in which the variables are connected by certain relationships, 
in which case, the maxima and minima are referred to as conditional. 

Let it be required to find the maxima and minima of the function 


JE Ca, «<> Um» Vm ++ «> Cm) 
of (m + n) variables x;, connected by n relationships: 


ars 0 (å = ains (17) 
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We shorten the writing in future by omitting the arguments of the 
functions. By solving the n relationships (17) with respect to n vari- 
ables, say 


Um+1s Umta 50 mtns 


we can express these in terms of the remaining m independent vari- 
ables 


Tis Lar Xm 


on substitution in the function f, we now get a function of m independ- 
ent variables, i.e. the problem has been reduced to that of finding 
the ordinary maxima and minima. The proposed solution of system (17) 
is often laborious or indeed impossible in practice, and we indicate 
an alternative approach to the problem, the method of Lagrange multi- 
pliers. 

Let the function f attain a conditional maximum, or a conditional 
minimum, at the point M(zr, £}, ..., %m+n). Having assumed the 
existence of the derivatives at the point M, we can say that the total 
differential of f must vanish at M [162]: 


s=1 
On the other hand, we obtain the following n equations at the same 
point M on differentiating relationships (17): 


m+n 


y Ow _ 9 (i= 1, 2, ..., m). 


0x, 





s=1 
We multiply these latter equations by the factors 
E Ans 


that are so far undefined, and add all these term by term to relation- 
ship (18): 








m+n 
3 lar tA, Get Ay o ay GE da, 0. (19) 


s=1 


We define these n factors so that each coefficient of the n differen- 
tials 


dxg 4i; AX o; PE dEmin 
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of the dependent variables vanishes; in other words, A, 4, ..., An 
are defined by the n equations: 


L4 À bp ji, Pe Es, KEA IPn zl (20) 


1 dx, de Ge, n Qa. 
(s-—m--1,m--2...,m4 n). 


The left-hand side of (19) now only includes terms that contain the 
differentials of the independent variables: 








dzi, dz,, ..., dE > 
Le. 








zie A A dà Ges +A, Get) de, = 0. (21) 


But the differentials dz,, dz,, ..., dz, of the independent variables 
are arbitrary magnitudes. We can put one of these equal to unity, 
and the rest zero, and it is now seen to follow from (21) that all the 
coefficients in the equation are zero [158], i.e. 











8 0 0 
ae + dy Sot + da GEE E + An Ger = 0 (22) 


(S51 O. m). 


In all the above formulae, starting with (18), the variables x, must 
be assumed to be replaced by the coordinates of the point M, at 
which, by hypothesis, f attains a conditional maximum or minimum. 
This applies in particular to equations (20), from which A, Az ..., An 
have to be determined. 

Equations (22) and (17) thus express the necessary conditions for 
a conditional maximum or minimum to be obtained at the point 
(8, Cas . +25 min). 

Equations (22) and (17) give us (m + 2n) equations for finding the 
(m + n) variables x, and the the n factors À. 

It is clear from system (22) that the values of x, for which the function f 
attains a conditional maximum or minimum are to be found by equating 
to zero the partial derivatives with respect to all x, of the function O, 
defined by : 


D =f + pit apt + + An > 


where 2i, o ..., An are considered as constants, and by taking in 
addition the n equations (17). 
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We deal briefly with the problem of sufficient conditions in the 
next section. 

It is to be noted that, in deducing the above rule, we have assumed 
not only the existence of the derivatives of functions f and ¢;, but also 
the possibility of determining the factors 24, À,, ... An from equations 
(20). As a matter of fact, our rule cannot give us certain values 
(24,23, ---> Empn), for which a conditional maximum or minimum is 
obtained. This will now be explained in more detail, and the theory 
made more precise. 


168. Supplementary remarks. Suppose that we are examining the conditional 
maxima and minima of a function f(x,y) with the auxiliary condition: 


pls, y) = 0, (23) 


and suppose that, for instance, a conditional maximum is found at the point 
(Lo, Yo), 80 that (xo, Yo) = 0. Let p(x, y) have continuous first order deriva- 
tives at, and in the neighbourhood of, (£o, Yo), and suppose further that 


Py (or Yo) # 0. (24) 


Equation (23) now uniquely defines in the neighbourhood of v = a, a func- 
tion y = w(x), which is continuous, has a continuous derivative, and satis- 
fies y, = w(x.) [157]. We can say, on substituting y = w(x) in the function 
f(x, y), that the function f[z, w(x)] of a single variable x must have a maximum 
at x = x, and its total differential with respect to x must therefore vanish 
at x= a, 1.0. 

fao Yo) + fio (Lo, Yo) (Xo) = 


Substituting y = w(x) in (23) and differentiating with respect to c gives 
us at (a, Yo) [69]: 


x (To, Yo) + gy o; Yo) © (zo) = 0. 


We obtain, on multiplying the second equation by 4 and adding term by 
term to the first: 


(x, + AP eg) + p, HAP) o (a) =0. 


On defining 4 by the condition ff, + Ag’, = 0, as is possible by (24), we got 
Yo Yo P 
Fxo + APx, = 0, ie. we arrive at the two equations: 


fy, HH =05 hy +4, =0, (25) 


with which the equation g(x, Yọ) = 0 is to be associated; this in fact verifies 
the multiplier method. If condition (24) is not satisfied, i.e. Py, (Lo Yo) = 0, 
but on the other hand 9x, (Lo, Y 0) Æ 0, all the above argument can be repeated, 
with the roles of x and y interchanged. If we have at (£o Yo): 


gx (Lo, Yo) = 0 and Pyg(Xos Yo = 90, (26) 


168] SUPPLEMENTARY REMARES 435 


we cannot prove that the point (xo, yo) is obtained by means of the method 
of multipliers. 

Equations (26) show that (£o Yọ) is a singular point of curve (23) [76]. 
We now give an example of a problem: in which condition (26) holds at a 
conditional minimum. 

Let it be required to find the shortest distance from the point (—1, 0) 
to points lying on the semi-cubical parabola y? — x? = 0, illustrated in Fig. 
87 [76]. The minimum is thus required of the function f = (æ + 1)? + y?, 
with the additional condition y= y? — 23—0. Itis clear geometrically that the 
minimum is found at the point (0,0) of the parabola, which is a singular point 
of the curve. The method of multipliers leads us to the following two equations: 


2(x + 1) — 342? = 0, 2y + 22y =0. 


On substituting x = 0, y = 0, the first equation leads to the absurdity 

= 0, whilst the second is satisfied for all 4. The method of multipliers does 
not lead us in this case to the point (0,0) at which the conditional minimum is 
reached. It can be shown similarly that, if a function has a maximum or a 
minimum at a point (£o, Yos Zo), Where there is the additional condition 
p(x, y, 2) = 0, and where at least one of the first order partial derivatives 
of o differs from zero at (Xo, Yo Zo), this point can now be obtained by the 
method of multipliers. 

Similar arguments apply in more general cases; but we have now reached 
the point of referring to the theorem on the existence of implicit functions 
for a system of equations, which we mentioned in [157]. Let the function 
f(x, y, 2), for example, have a conditional maximum at the point (£o, Yo, Zo), 
with the two additional conditions 


plz, y, z) = 0, p(x,y,2)=0, (27) 


and with the usual assumptions regarding the existence and continuity of 
the derivatives; and let us also have 


y o Yo Zo) Va (Dor Vo 20) — ga (to, Vo Zo) Pyy (Tor Yor Zo) Æ 0. (28) 


Equations (27) now uniquely define the functions: y = @,(x), 2 = w(x), 
such that y, = 0,(2,); Zo = @,(%,). Substitution of these functions in f gives 
us a function of x only, with a maximum at x = z,, whence: 


fao Y o» Zo) + Ly (o Yo 20) @4(%o) + faq (Lor o Zo) g(%) = 0. 


On substituting the same functions in (27) and differentiating with respect 
to x at the point (£e, Yo, Zo), we gob: 


Pi, F 95,01 (o) + 92,05 (20) =05 Vx, + vy o (29) + Pra (20) = 0. 
We multiply these equations by 4,, 4, and add to the previous equation: 
(Eno F Aug, + Ap) E y, + AP, + Ay) + (eg + AP) + 4a¥2,) — 0. (29) 


We can assert, on taking (28) into account, that 4, and 4, can be defined from 
the two equations 


fy, FAP) EA 203 fa + Age, + A = 9, (30) 
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after which, (29) reduces to 
Eso + Ag + AP, = O, (31) 


which verifies the method of multipliers in the present case. We must also 
take, in addition to equations (30) and (31): 


PlLo Yor Zo) = O and y(z,, Yo, Zo) = 0. 


We might have replaced (28) by a similar condition, obtained by differen- 
tiating with respect to r, and y, or x, and z, instead of with respect to y, 
and z,. But if the expression on the left of (28) is zero, and the analogous 
expressions, obtained by differentiating with respect to x, and y, or 2, 
and Zz, are also zero, we are unable to justify the method of multipliers for the 
point (2,, Y y, 2,). It can be shown that this special case cannot arise in any of 
the examples of the next section. For example, we have one auxiliary condi- 
tion (32) in example 1, and at least one of the numbers A, B and C differs 
from zero on the left-hand side of the condition. If C Æ 0, for instance, the 
derivative of the left of (32) with respect to z is equal to C, and therefore 
differs from zero at every point (x, y, z). This indicates that an answer must 
be obtainable here by the method of multipliers. 

We now briefly discuss the sufficient conditions for a conditional maximum 
or minimum, whilst confining ourselves to the case of two independent variab- 
les. Let us look for the conditional maxima and minima of a function f(x,y,z), 
with the condition (x, y, 2) = 0. We form a new function Y = f -+ dp. 
We equate the first order partial derivatives with respect to x, y, z to zero 
and solve the equations with the added condition, and we thus obtain, say, 
L = Lo, Y = Yo, Z = Zp A= Ay. We have to investigate the values obtained, 
ie. find the sign of the difference f(x, y, 2) — f(Lo, Yo» Zo) for all (x, y, 2) 
sufficiently near (r,, Yo, Zo) and satisfying our condition g(x, y, 2) = 0. We 
introduce the function y(x, y, z) = f(a, y, 2) + Ay p(x, y, z). It follows immedi- 
ately from the equation of the condition that we can take the difference 
p(x, Y, 2) — v(2, Yo, Zo), and investigate its sign, instead of taking f(x, y, 2) — 
— f(£o Yo, Za). The first order partial derivatives of y vanish at (x, Yo, Zo) 
by hypothesis. On expanding the latter difference by Taylor’s formula and 
confining ourselves to the second derivatives, we get an expression of the form 
[ef. 165]: 


p(x, Y, z) — Pla, Yo, Zo) = ay; de? + az dy? + agg dz? + 2a,,dady + 
+ 2a,a, dedz + 2a,, dydz + ..., 


where a;, denote the values of the corresponding second order partial deri- 
vatives of p(x, y, 2) at (Xo, Yo, Zo), and dx, dy, dz denote the increments of 
the variables. We take pz,(%o, Yo, Zo) #0, so that the equation of the condi- 
tion defines z = w(x, y), where 2, = w(x, Yo). We got from the equation 
of the condition: 


Plr, y. z) de + py(z=, Y, 2) dy oh ple, y 2) dz — 0. 
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On substituting here x = x, y = yy 2 = £, We can express dz in terms 
of dx and dy: 





— Poo (Zo Yoo 20) de — Ph (Zo Yor Zo) dy. 
9, (Los Yo» Zo) Fz (To Yor Zo) 
We substitute the expression for dz in the previous formula and collect 
similar terms, and obtain: 


V(z, y, 2) — v(zo Yo Zo) = Ada? -+ 2Bdedy + Cdy? + ... 


We can now use the test for maxima and minima of [163]. For instance, 
if AC — B? >0 and A >O, f(x, y, z) has a conditional minimum at (£o, Yos 29). 
It immediately follows from the discussion of [163] that a sufficient basis 
for the rule concerned is the assumption that f(x,y,z) and p(x, y, z) have 
continuous derivatives up to the second order at, and in the neighbourhood 
of, the point (To, Yo; 2,). 

We shall not dwell further on the question of sufficient conditions for con- 
ditional maxima and minima. The essential points in the above argument were 
the replacement of the difference f(x, y, 2) — f(x Yo 20) by v(x, y, 2) — 
— (Xo, Yo 20), the first order derivatives of which vanish at (zr, Yo, 2); 
and the fact that the differential dz of the dependent variable was defined 
in terms of the differentials dx, dy of the independent variables by an equation 
of the first degree. A similar approach must be used when considering the 
sufficient conditions for any other number of variables and conditions. 


169. Examples. 1. We are required to find the shortest distance from the 
point (a, b,c) to the plane 
Ax + By 4- Cz -- D — 0. (32) 


The square of the distance from the point (a, b,c) to the variable point 
(x, y, 2) is given by 


r? = (x — ay + (y — b} + (2 — ey. (33) 


The coordinates (x, y, 2) must here satisfy equation (32) (the point must 
lie on the plane). We find the minimum of expression (33), with the condition 
(32). We form the function: 


© = (x — a)? + (y — by + (z — ey + Ade + By + Cz + D). 


Equating to zero its partial derivatives with respect to x, y, 2 gives: 





1 1 1 
t=a— iA, y=b— SAB, z=e— FAC. (34) 
We can find 4, by substituting these expressions in (32): 
_ 2(4a4 Bb 4- Cc + D) 
A= At + BOO (35) 


We have obtained unique values for the variables, and these must give 
the required least value, since this necessarily exists. Substitution of the 
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values of (34) in (33) gives us the expression for the square of the distance 
from the point to the plane: 


rp = 44+ B EOD, 


where 2, is defined by (35). 


2. To express a positive number a as the sum of three positive terms x, y, 2, 
so as to obtain the greatest value of 


Ema (36) 


(m, n, p are given positive numbers). We have to find the maximum of expres- 
sion (36) with the condition 


+y+2=a4. (37) 


Instead of looking for the maximum of (36), we can look for the maximum 
of its logarithm 


mlogx + nlog y + p log z. 


We form the function: 


D=mlog z + nlog y + plog2 + 4 (x= +y+2—a). 
We obtain, on equating the partial derivatives to zero: 


P 


CONI. NS OE 3 
AC 4,’ 


A? 


and (37) now gives: 


i.e. 
Es ma na P pa 
mintp’ V mintp” m+n+p ’ 





(38) 


the values obtained for the variables being positive numbers. It can be shown 
that (36) must have a greatest value with the conditions postulated, and the 
uniqueness of the result shows, as in Example 1, that this greatest value must 
in fact be given by the values obtained for the variables. 

It can be seen from (38) that a has to be split into components proportional 
to the exponents m, n, p, to obtain the solution of the problem. 

We suggest that the reader carry out an investigation of the sufficient 
conditions in the last two examples, by the method outlined in the previous 
section. 

3. A conductor of length l, branches at one end into k separate conductors 
of lengths 1, (s = 1,2, ..., k), the corresponding currents in the individual 
conductors being io, is, ..., 4,4. The problem is to choose the cross-sectional 
areas Qo, Q1, - - ., Qj Of the conductors so as to employ the least quantity of material 
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V for a given potential difference over the circuits (Lo, li), (los la), . . ., (Los D). 
(Fig. 167) 
Let ¢ denote the resistance of a wire of the given material, of unit length 
and unit cross-sectional area. 
We require the least value of the function V of variables qos 41, . .., Qj 
given by 
Velgthat..-+& a: 





Fie. 167 


We can use the given potential difference E to write down k relationships: 


$e (“9 + I) — mo (e—1,2, ..., k). (39) 
Qo qs 
We form the function: 


k loto bis 
D = (lg, + 1,93, + ... + gy) + DAs o ($e + 2) y . 
sey qo ds 


We equate to zero the partial derivatives of ® with respect to qo, G1, » --» Qk? 








cla 
ly — di (A HF Ag+... + A) =0, 
heli (40) 
I, ~ 85 —0 (¢=1,2,..., k). 
qs 
We have from condition (39): 
ho hA Tidy E o dh». 
à de — UC q” 
if we let ø denote the common value of these ratios, we can write: 
g= té (8=1,2,..., k), o=Y hie, (41) 


e Yo. 


We obtain from equations (40): 


wee 


q LE Bi, 


s 2 
Cig co 
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On substituting these expressions for A, in the first of equations (40), 
we get: 


DU. T ; 
qiu rs, Ii), 








or 
ag Vig (m, + Ifi, deed Di) : 
qo E hie. 
e go 


whence finally: 





p= E [oio + Vi, (95, A 











We substitute this expression for q, in (41) and obtain for qi, 42, .... dy: 
a= at hh : (@=1, 2, ..., k). 
E Vig Q5, + Tig +... + iy) 


The necessary conditions for a maximum or minimum of V are seen to 
give us a unique set of positive values for qo, 41, ..., gj, and these must 
represent the solution of our problem, since it is evident from physical con- 
siderations that a minimum quantity of material must be obtained for a 
certain choice of the cross-sectional areas. 


EXERCISES ON CHAPTER V 


1. Find 1(1/2, 3), f, — 1) for f(x, y) = zy + aly. 


2. Find f(y,z), f(—=, —y) fey), MfG y) for fv. y) = (55). 


3. Find the value of f(z, y) =1-+ x — y at points on the parabola 
y = r. 

4. Find the value of z = (2% + 22? y? + y‘)/(1 — a? — y?) at points 
on the circle 2? + y? = R°. 

5. Define f(x) if f(y/x) = y Ma? + y2)!U?, y > 0. 

6. Find f(z, y) if f(x + y,%—y) = xy + y. 

7. Let z= Vy + f(x — 1). Determine the functions f and z if 
z = x when y = 1. 

8. Lot z = af(y/x). Determine the functions f and z if z = V(I + y?) 
when x = 1. 

9. Find and represent the domains of existence of the following 
functions: 
(a) z = Y(3Y —a* — y}; (b) 2 = 1 + Y =(2— y); (c) 2 = log(z+y); 
(d) z = x + are cos y; (e) z = /(1— 2) + Y(1 — y); 
(f) 2 = aro sin (y/x); (g) 2 = V(x* — 4) + V — y*); 
(h) 2=V(* Ey — a) (2? — 3$ — y5] (a > 0); 
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(i) z =/ (y sin 2): (j)z =1og(a* + y); 
AY 
(k) z = are tan Tree l 
() 2= (2 + y) 8 (m) z = (y — a”; 
1 O a ONT 
(n) 2===3 + Y (0) z = V[sin(z? + y]. 
Find the partial derivatives of the functions 10-15: 
10. u = (xy). 11. v = 27, 12. u = x? + y? + 22 — Bayz. 
13. u = y t(l — 22). 14. u = r^), where r = V (x? + y? + 22). 
15. u = ayz|(w + y) (2 + 1) 
16. Find f, fy, f, at the point (1, 2, 0) for the function f(x, y, 2) = 
= log(xy + 2). 
17. Calculate the value of the determinant 
Ox Or Ox 
Or 00 Op 
Y by Oy 


ór d Op 


s d 8 
Or 00 ap 





for the functions x — r cos ọsin 0, y =r sin p sin 0, z = r cos 0. 
Verify Euler's theorem (on homogeneous functions) for the functions 
18-23: 
18. 2 = ax? + 2hxy + by? 19. z = x[(x? + y’). 
20. z = (£ + y) (a? + y’. 21. z = log(y/x). 
22. 2 = log(w + y) — log(x +2). 23. 2 = xy sin(x]y). 
24. Show that if z = log(2? + xy + y?), 
Oz 


217: 
25. Show that if z = zy + xe”, 
0 0: 
ras Hyg = ty +2. 
26. Show that if u = (x — y) (y — 2) (2 — x), then 
d. + dy Y 2. 
27. Show that if u = x + (x — y)/(y — 2), 
ve A eT 1. 
28. Find 2* and & if z == arc tan ¥ and y = ed 


29. Find 2 or z and E dz £2 = 2” where y = g(z). 
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30. Find 2 and 9 ifo = f(u, v), where u = 2? — y?, y = e”. 


dy 
. Oz OZ op y u . u 
31. Find 5, and gz if z = arc tan z where x = u sin v, Y = u cos v. 
32. Find z and jie = f(u), where u = ry + yjz. 
33. Prove that if u = D(a? + y? + 2), where x=rcosqsin 0, 
y = r sin p sin 0, z = r cos 0, then 
du Ou 


34. Prove that if 2 = f(x + ay), where f is a differentiable function, 
then 
Oz —— a Oz 
CR 


35. Prove that the function w = f(u, v) where u = x + at, v = y + bt, 
satisfies the equation 

ðw _ Ow Ow 

a T s +6 Oy ` 
Oz Oz Oz 
Oat’ OrOy' Gy? 
. q 0% 0% 0m . 
37. Find 53, roy! Oy if z = log (a? + y). 


IN o? . LL —— — —— 
38. Find aa 5j if z= /2xy + y?. 





36. Find if z = o(a2/a? + y?/b?)u2, 





Om 0% 
oxy Oy ox" 








39. Prove that for the function z = arc sin |/(x — y)/z, 


2 2 
40. Prove that for the function z = 2” P = eu 





41. Prove that for the function defined by the equation 
Ty (2? — yy? + y?) if (x, y) # (0, 0), 
Hu, y) = o 
at (0, 0), 
fey (0, 0) = — 1, fj, (0, 0) = +1. 
42. Prove that the function u = log r, where r= l//[(x — a)? + (y — b)?] 
satisfies the two-dimensional Laplace equation 
3u Ou 


Gat + Oye 





= 0. 





43. Prove the function 
u(x, t) = Asin(Aat+ D) sinAx, 
where 4, ©, a are constants, satisfies the one-dimensional wave 
equation 
Ou Pu 
oF =U Be 
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Investigate the maxima and minima of the functions 44-52: 
44. (x — 1)? + 2. 45. (x — 1)? — 2y?. 46. a? + zy + y? — 2x — y. 
47. xy? (6 — x—9), x > 0,y > 0. 48.2* -- y* — 2a? + 4ry — 22. 
49. ay l1 — a?Ja? — y6. 50. 1 — (a? + ya?” 
51. (a? + y?) e- OF), 52, (1 + x — y) (1 + a2 + yT. 
53. Find the turning values of the functions defined by the equations 
(a) a? + y? + 22 — 2x + 4y — 62 — 11 = 0; 
(b) 33 — y — 3r + 4y - 2--2— 8-0. 
Determine the conditional extrema of the functions 54— 59: 
54. z = xy with x + y = 1. 55. z = x + 2y with 24+ y? = 5. 


56. z — a? + y? with ls iy-i. 





57. 2 = cos? g + cos?y with y — x= in, 


58. u = x — 2y+ 22 with a? + y? + 2 = 9. 
59. u= a? 4 y? + witht, +42 S 1, (a bo 6 0). 


60. From all rectangular parallelepipeds of prescribed volume V, 
find the one which has minimum surface area. 

61. If an open rectangular box has prescribed volume V, find the 
dimensions if its surface area is to be the least possible. 

62. Find the shape of the triangle with prescribed perimeter 2p 
enclosing the maximum area. 

63. Find the shape of the rectangular parallepiped of prescribed sur- 
face area S enclosing the maximum volume. 

64. Find the triangle of prescribed perimeter 2p, which when rotated 
about one of its sides generates a solid with the greatest possible 
volume. 

65. Masses of amount m,, m,, m4 are placed at the points P,(x,, yi), 
Palo, Yo), Pala Ya). Find the position of the point P such that 
the moment of inertia of these masses relative to P is a minimum. 

66. Find the equation of the plane through the point (a, b, c) which 
together with the coordinate planes forms a tetrahedron of mini- 
mum volume. 

67. Find the dimensions of the rectangular parallelepiped of maximum 
volume which can be inscribed in an ellipsoid. 


CHAPTER VI 


COMPLEX NUMBERS. 
THE FOUNDATIONS OF HIGHER ALGEBRA. 
INTEGRATION OF VARIOUS FUNCTIONS 


$ 17. Complex numbers 


170. Complex numbers. As we know, the operation of extracting 
a root is not always possible if we confine ourselves to real numbers; 
the root of even degree of a negative number has no meaning in the 
domain of real numbers; and moreover, a quadratic equation with 
real coefficients does not always possess real roots. This situation leads 
naturally to a broadening of the concept of number by means of the 
introduction of new numbers of a more general kind, with real numbers 
appearing as a particular case. These new numbers, and the operations 
on them, must be defined in such a way that all the basic arithmetical 
laws that are known for real numbers remain valid. The fact that 
such a definition is possible is established below. 

Apart from the impossibility just mentioned of extracting a root 
in certain cases, simple geometrical considerations also lead to a 
natural extension of the concept of number. It is the geometrical 
approach to the problem that we shall choose. 

We know that every real number can be represented graphically 
either by a segment, cut off from a given axis OX, or else by a point 
on this axis, if we agree to locate the initial point of all segments at 
the origin of coordinates. Conversely, a definite real number corres- 
ponds to every segment or point on the axis OX. 

If we now consider the whole plane, in reference to the coordinate 
axes OX, OY, instead of taking the single axis OX, suitable generali- 
zation of the concept of number will enable us to associate a certain 
number, which we call complex, with every vector, or with every point, 
lying on this plane. 

444 
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If we agree not to distinguish between vectors of the same length 
and the same direction, a real number can be associated not only with 
every vector along OX, but with every vector in general that is parallel 
to OX. The real number one, in particular, corresponds to a vector 
of unit length in the positive direction of OX. 

We associate the symbol i, called imaginary unity, with a vector 
of unit length in the positive direction of the axis OY. Every vector 
MN in the plane can be represented as the sum of two vectors MP 
and PN, parallel to the coordinate axes (Fig. 168). Some real number 
a corresponds to the vector MP, parallel to OX. Let the symbol bi 
correspond to the vector PN, parallel to O Y, where b is a real number 





which is equal in absolute value to the length of PN, and which is 
positive or negative according as the direction of P.N coincides with 
the positive or negative direction of OY. We now naturally associate 


with the vector MN a complex number, of the form 
a + bi. 


We note the fact that the (+) sign in the expression written, a + bi, 
is not the sign of an operation. The expression is to be viewed as a 
single symbol for denoting the complex number. We shall give our 
attention to the sign after defining addition of complex numbers. 

The real numbers a and 6 evidently consist of the magnitudes of 


the projections of the vector MN on the coordinate axes. 
We mark off the vector OA from the origin of coordinates (Fig. 168), 


with the same length and direction as MN. The end A of the vector 
will have coordinates (a, b), and we can associate the same complex 


number a + bi with the point A as with vectors MN and OA. 

Thus, a definite complex number a + bi corresponds to every vector 
in the plane (to every point of the plane). The real numbers a and b 
are equal to the projections of the vector on the coordinate axes (to the 
coordinates of the point). 

We obtain the set of complex numbers on assigning all possible 
real values to the symbols a and b in the expression a + bi. We call 
a the real part, and b the imaginary part, of the complex number. 

In the particular case of a vector parallel to OX, the complex 
number coincides with its real part: 


a+0=a4. (1) 
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We look on the real number a as a particular case of a complex 
number, in accordance with formula (1). 

The concept of the equality of two complex numbers follows from 
the geometrical interpretation. Two vectors are looked on as equal 
if they have the same length and the same direction, i.e. if they have 
the same projections on the coordinate axes; hence, two complex 
numbers are considered to be equal when, and only when, their real parts 
and their imaginary parts are separately equal, the condition for equality 
of complex numbers being : 


a, + bi 1 = a, + b, i implies a, = dy, b, = by. (2) 


In particular, a + bi = 0 implies a= 0, b = 0. 

Instead of defining the vector MN by its projections a and b on 
the axes, we can use two other magni- 
tudes: its length r and the angle o that 


the direction MN makes with the posi- 
tive direction of OX (Fig. 168). If we 
take the complex number a + bi as 
corresponding to the point with coordi- 
nates (a,b), r and » are evidently the 
Fra. 168 polar eoordinates of the point. We know 
that 





a = r cos g; b = r sin Q; 





r=\Væ t; cosg =—= ; sinp= 


b . 
Va? 4-9: Vat + bt | 


i (3) 
p = arc tan FA | 


The positive number r is called the modulus, and y the amplitude, 
of the complex number a + bi. The amplitude is only defined to an 
accuracy of plus or minus a multiple of 27, since any vector MN 
remains unchanged on rotation any integral number of times in either 
direction about the point M. When r = 0, the complex number is 
equal to zero, and its amplitude is completely indeterminate. The 
condition for equality of two complex numbers evidently consists in 
the fact that their moduli must be equal, and their amplitudes must 
only differ by some multiple of 2x. 
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A real number has the amplitude 2kz if it is positive, and (2k+1) x 
if it is negative, where k is any integer. If the real part of a complex 
number is zero, the complex number has the form bi and is said to be 
pure imaginary. The vector corresponding to such a number is if 
parallel to OY, whilst the amplitude of bi is equal to (7/2 + 2kx) 
b > 0, and (32/2 + 2kn) if b < 0. 

The modulus of a real number is the same as its absolute value. 
The modulus of the number a + bi is denoted by writing the number 
between two vertical strokes: 


|a + bi] = Va? + 83. 


In future, we shall often denote a complex number by a single letter. 
If a is the complex number, its modulus is denoted by |a |. We can 
utilize expressions (3) for a and b to express a complex number in terms 
of its modulus and amplitude, in the form: 


r(cos p + isin q). 


The complex number is said to be expressed in trigonometric form 
in this case. 


171. Addition and subtraction of complex numbers. A vector sum 
consists of the closing side of a polygon made up of the added vectors. 
Since the projection of the closing side is equal to the sum of the pro- 
jections of the components, we arrive at the following definition of 
addition of complex numbers : 


(a, + bi) + (a, + 5,9) + ... + (Gn + bni) = 
= (a, + dg +... + an) + (0, +0,+... + Oy) 1. (4) 


It is easily shown that a sum of complex numbers is independent of 
the order of the terms (law of transposition), and that the terms can 
be combined in groups (law of association), since these properties 
belong to the sums of the real numbers a, and bp. 

As was mentioned above, the complex number a + 07 is identical 
with the real number a. Similarly, the number 0 + bi can simply 
be written as bi (a pure imaginary number). We can use the defini- 
tion of addition to assert that the complex number a + bi is the 
sum of the real number a and the pure imaginary number bi, i.e. 


a + bi = (a + 00) + (0 + bi). 
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Subtraction is defined as the converse operation to addition, i.e. 
the difference 
a+ yi = (a, + b,1) — (a, + b, 1) 
is defined by the condition 
(x + yi) + (a, + bzi) = a, + bit, 
or, by (4) and (2): x + æa = @; y+0d,=0b,, ie. x =a, — 4), 
y = b, — b, which finally gives 
(a, + b, i) — (a, + b,1) = (a, — a) + (b — 5) i. (5) 
Subtraction of the complex number (a, + 6,7) from (a, + b,?) is 
seen to be equivalent to addition to the latter of the complex number 






Az (05,02) 





Fie. 169 Fie. 170 


(—a, — b, i). This corresponds to the following: subtraction of a vector 
from a second vector is equivalent to the addition to the second vector of 
a vector of equal magnitude but opposite direction to the first. 


We consider the vector A, A,, with its initial point A, corresponding 
to the complex number a, -+ 6,7 and its terminal point A, corre- 
sponding to a, + b, i. This vector evidently consists of the difference 
of vectors OA, and 0A, (Fig. 169), and its corresponding complex 
number is therefore 

(a, — a) + (6; — 5) $, 
equal to the difference between the complex numbers corresponding 
to its initial and terminal points. 

We now establish the properties of the moduli of the sum and 
difference of two complex numbers. Since the modulus of a complex 
number is equal to the length of the corresponding vector, whilst one 
side of a triangle is less than the sum of the other two sides, we have 
(Fig. 170): 

[a +a |< |a] + |a], 
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where the sign of equality only occurs when the vectors corresponding 
to a, and a, have the same direction, i.e. when the numbers have 
amplitudes that are either equal or differ by a multiple of 27. This 
property is evidently valid for any number of terms: 


lat a+. ban] & | + Fo e e + (an, 


ie. the modulus of a sum is less than or equal to the sum of the moduli 
of the terms, the sign of equality only occurring when the amplitudes of 
the terms are equal or differ by a multiple of 22. 

Since one side of a triangle is greater than the difference of the two 
remaining sides, we can also write: 


|a +a |> |a |— 1o]; 


i.e. the modulus of the sum of two terms is greater than or equal to the 
difference of the moduli of the terms. Equality will only occur when the 
corresponding vectors have opposite directions. 

The subtraction of vectors and complex numbers leads to an addi- 
tion, as we saw above, and we have for the modulus of the difference 
of two complex numbers, as for the modulus of the sum (Fig. 170): 


|a| — [a| < |a; — a| < |a |+ lo. 


172. Multiplication of complex numbers. The definition of the 
product of two complex numbers is similar to the definition of the 
product of two real numbers, which is: the product is taken to be the 
number composed of the multiplicand, as the multiplier is composed 
of unity. The vector corresponding to the complex number with 
modulus r and amplitude y can be obtained from the unit vector, 
with unit length and directed in the positive direction of OX, by 
increasing its length r times and rotating it positively through an 
angle g. 

The product of a vector a, and a vector a, is defined as the vector 
obtained by applying the same elongation and the same rotation 
to a, as are applied to the unit vector in order to obtain a,. The unit 
vector here obviously corresponds to real unity. 

If (ri 94), (%, P) are the moduli and amplitudes of the complex 
numbers corresponding to vectors a, and a,, the complex number 
corresponding to the product of these vectors will clearly have modulus 
rı r, and amplitude (y, + p). We thus arrive at the following defini- 
tion of the product of complex numbers: 
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The product of two complex numbers is defined as the complex number 
the modulus of which is equal to the product of the moduli of the factors 
and the amplitude of which is the swm of the amplitudes of the factors. 

When the complex numbers are given in trigonometric form, 
we have: 


r,(cos p, + ¿sin q,) * ra(cos p, + 2 sin p,) = 
= r, r¿[cos(p, + pa) + ¿sin(p, + qa)]- (6) 


We now deduce the rule for finding the product of two complex 
numbers that are not given in trigonometric form: 


(a, + b, 5) (a, + b, i) — x + yi. 


We ean write, using the above notation for the moduli and the 
amplitudes of the factors: 


a, = 7,008 py; bd, = r, Sin p,; Ay = 7, COS qa; b, = r, sin gs, 
and in accordance with the definition (6) of the product: 


t = rar, COS(P, + Pa); y = r, n sin(p, + q») > 
whence: 


€ == r,r,(cos p, cos o, — sin q, sin p,) = 
== f, COS Q, * 7, COS 95 — r, Sin q, * r Sin q, == a, a, — bi bz, 
y = rı ra(sin pı cos p, + cos q, sin p) = 
== fri SİN 9, * 7, COS q, + 7, COS q, * r; Sin MP, = b; a, + a, bz, 
so that we finally get: 
(a, + b å) (a, + b, i) = (a1 a, — b, b) + (biaz + aba) 1. (7) 
When b, — b, — 0, the factors are the real numbers a, and a, 
and the product reduces to the product of these numbers. 
When a, = a, = 0, and b, = b, = 1, equation (7) gives: 
trt= P= —l1, 
i.e. the square of imaginary unity is equal to (—1). 
Evaluation of positive integral powers of ? thus gives: 


$ = —1l, 8 = —i, it [= 1,8 = 7,18 = —1 








and in general, for any positive integer k: 


‘åk „åk + o4 4k43 
gE m uq MR LS p i a 
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The multiplication rule expressed by (7) can be stated as: complex 
numbers are to be multiplied algebraically, taking 1? = —1. 

Tf a is the complex number a + bi, the complex number a — bi 
is called the conjugate of a, and is denoted by a. 

In accordance with (3): 


|a P = a? +0, 
But from (7): 
(a -+ bi) (a — bi) = a? + B, 
so that: 
|a P = (a + bi) (a — bi) = aa, 


i.e. the product of conjugate complex numbers is equal to the square of 
the modulus of each. 
We also note the obvious expressions: 


a +a = 2a; a — a = 2bi. (8) 


It immediately follows from (4) and (7) that addition and multi- 
plication of complex numbers obey the commutative law, i.e. a sum. 
is independent of the order of its terms, and a product is independent 
of the order of its factors. It is easy to verify the associative and 
distributive laws, expressed by the identities: 


(a, + ay) + a4 = a, + (a, + az); (a, Gy) az = a, (a, 04); 
(a, + a5) B — a, B +a p. 


We leave the proof to the reader. 

We remark finally that the product of several factors has a, modulus 
equal to the product of the moduli of the factors, and an amplitude equat 
to the sum of the amplitudes of the factors. Also, a product is zero when. 
and only when at least one factor is zero. 


173. Division of complex numbers, Division is defined for complex 
numbers as the inverse operation to multiplication. It can easily 
be seen to follow that, if a complex number of modulus and amplitude 
(rj, py) is divided by a number of modulus and amplitude (r, p3), 
division yields à unique result if the divisor differs from zero, the 
modulus and amplitude of the quotient being r,/r, and (p, — p). 
We can write: 


rı (cos p, + ? sin p) r a : 
n (cos Pa + ¿sin pa) = " [cos (p, — p2) + isin (p, — P2)]- (9) 
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Thus, the modulus of the quotient of two complex numbers is equal 
to the quotient of their moduli, whilst the amplitude of the quotient is 
equal to the difference of their amplitudes. If r, = 0, (9) becomes meaning- 
less. 

If the numbers are given as a, + bj? and a, + b, i instead of in the 
trigonometric form, their moduli and amplitudes can be expressed in 
terms of a,, Q, b, b, and set in (9), and the following expression 
obtained for the quotient: 


a, +bj aa, + b,b: Ba ba, — a,b, i 

a+b aibi abt C 
This expression can also be obtained directly by considering as irrational 
and multiplying numerator and denominator by the conjugate complex 


of the denominator, thus getting rid of irrationals in the denominator: 


a, + bt _ (m + by) (4, — bj) _ (90, + b,b) + (b,a, — a,b, i 


a, + bai aibi ai bi , 
and finally: 











a +6, — a, bb, sE bia, — a,b, i. (10) 


a+b aj+02% al + b? 


We mentioned above [172] that the commutative, associative and 
distributive laws continue to be valid in the case of addition and mul- 
tiplication of complex numbers; and it follows that all the well-known 
transformations in regard to real numbers that follow from these 
laws, also hold for expressions containing complex numbers. We 
include here: the rules for enclosing in and removal of brackets, ele- 
mentary formulae, Newton's binomial formula with positive integral 
exponent, formulae relating to progressions, etc. 

We note a further important property of expressions containing 
complex numbers, connected by the signs of the four primary opera- 
tions. The following proposition is obtained at once from (4), (5), (7) 
and (10): ¿f all the numbers in a sum, product, difference or quotient are 
replaced by their conjugates, the result of the operations is also replaced 
by its conjugate. 

For example, on substituting (—b,) and (—b,) for b, and b, in (7), 
we get: 


(a, — b, i) (a — b,1) = (a, a, —b,b,) — (b, a, + a, ba) 3. 
This property will evidently hold for any sort of expression that con- 


tains complex numbers, connected by the signs of the four primary oper- 
ations. 
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174. Raising to a power. We obtain the rule for raising a complex 
number to a positive integral power by applying (6) in the case of n 
equal factors: 


[r(cos p + ¿sin p)]” = r"(cosng + isin n p) , (11) 


i.e. a complex number is raised to a positive integral power by raising 
its modulus to the same power and multiplying its amplitude by the 
exponent of the power. 

We obtain de Moivre’s theorem on putting r = 1 in (11): 


(eos p + ising)” = cosn g + isin n o. (12) 


Examples. 1. If we expand the right-hand side of (12) by Newton’s 
binomial formula and equate real and imaginary parts in accordance with 
(2), we get expressions for cos no and sin np in powers of cos p and sin gf: 


cos np = cos" y — (2) cos" 7? y sin? p + 


+ (3) cost p sin! p +...-+ (— 1* (2) cos 7 o sip g + 


(— 1)? sin" p n even) 
reci naa (n eve (13) 


(— 1) 2 nceosgsin" ! p (n odd) 


sin ng — (i) cos"! p sin p — (3) cos"? y sin? o + (8) cos "^5 p sind 9 —...4- 


doo (7 DX (24) cos"? 71g lg h.. 


n-2 
A 1)? ncospsin""*y (n even) 
(— 1) ? sin" g (n odd). 


In the particular case of n = 3, (12) becomes, after removing the brackets: 
cos? o + 31 cos? p sin p — 3 cos p sin? p — ¿sin? 9 = cos 3p + ¿sin 39 , 
whence 
cos 3p = cos? p — 3 cos p sin? p; sin 3p = 3 cos? y sin p — sin? g. 


f The symbol (Z) denotes the number of combinations of n elements 
in groups of m, i.e. 





(2) =20-D- (n= mF) nl 
mp 1-2...m — mi(n—m)l ? 
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2. To sum the expressions: 


An=1+"rc08p +12c0s 2p +... + ricos (n — 1)p, 
B,=rsing + rsin 2p +... +r" sin (n — 1) g. 


We put: 
z = r(cos p + ? sin p) 


&nd form the complex number: 
An + Bí 4 = 14 r(eos 9 + ¿sin g) + 
+ r(cos 29 + ¿sin 29) + ... + r^ [cos(n — 1) p + ¿sin (n — 1) g]. 


Wo use equation (11) and the formula for the sum of a geometrical progres- 


sion: 
1—2" 1—r"(cosp + ising)" 


NR » n-i.. 
Ag Bpi=1 424840 . 42 l=2 l—r (cosg + ising) 








(1 — r" cos ng) — ir" sin np 
(1=—reos p) — érsinp ` 








On multiplying the numerator and denominator of the last fraction by 
(1 — r cos p) + tr sin py, the conjugate of the denominator, we get: 


. [1 — r" cos ng) — ir" sin np ] [(1 — r cosp) J-?rsing] _ 
An + Bai = (1 — r cos p)? + r? sin? p i3 





(1 — r" cos ng) (1 — r cos p) + r"*! sin ọsin np + 
0 r? — 2rcos p + 1 


4 (1 — r cos ng) r sin p — (1 — r cos g) r" sin np nt 
r? — 2rcosg + 1 





r"+1 cos (n — 1) e — r" cos np — reos p + 1 + 
n T? —2rcosgp + 1 





+ r^t! sin (n — 1) p — r" sin ng + r sin p 
r2? — 2rcosp +1 





Equating real and imaginary parts in accordance with (2) gives: 


An=1+rcosp+1?cos2p +... -- 1" cos (n — I) p = 


r^ *! cos (n — 1) p — r" cos np — reos p +1 
r? — 2rcosp +1 


E 





B,=rsinp+rsin2p+... +r"! sin (n—1)p= 


r^ *! sin (n — 1) p — r" sin ny + rsing 
rT? —2rcosg +1 
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On taking the absolute value of the real number r as less than unity, and 
letting n increase indefinitely, we obtain in the limit the sum of the infinite 
series: 











l — reos 
l--roosg --r*cos29 +... TET ee | 
ai (14) 
; ; rsin 
rsing 4-7?sin2p +... = ica eom al 


If we set r = l in A, and B,, we get: 


cos (n — 1) p — cos np — cos p -+ 1 








l + cos g + cos 29 +... + cos (n — 1) q = 














2 (1 — cos g) 
2sinf (n — 3) + 2sin? 2 sin (n — 5) + sin? 
O 2 2j? 2 AM 2 
int 2. ARA 
4 sin; 9 2 sin 9 
du iE VV LEDE 
2 2 
= = (15,) 
sm 
We find similarly: 
sin “Z sin n ur 
sing + sin 2p +... + sin (n — 1) gy = ———— —— ————— - (15,) 
sin -$ 


175. Extraction of roots. The n-th root of a complex number is defined 
as the complex number whose n-th power is the original number. 
The equation 





y (cos p + ¿sin y) = o (cos y + ¿sin y) 
is thus equivalent to: 
Q" (cos ny + ¿sin n y) = r(cos p + 1 sin q). 
But the moduli of equal complex numbers must be equal, and their 


amplitudes can only differ by a multiple of 2x, i.e. 


č =r, ny =ọ + 2kn, 
whence 


n 
2k: 
e= Jr, y= Li, 


n 


n 
where /r is the arithmetic value of the root, and k is any integer. 
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We thus have: 


n 


n 
Vr (cos p + i sin p) = yr (cos EM + isin 2422) , (10) 





i.e. the root of a complex number is to be extracted. by extracting the root 
of its modulus and dividing its amplitude by the exponent of the root. 

All possible positive integral values can be taken by k in (16); 
it can be shown, however, that there are only n different values of 
the root, corresponding to: 


k=0,1,2,..., (n — 1). (17) 


We prove this by noting that the right-hand side of (16) has differ- 
ent values for two different values of k, k = k, and k = k, when 
Pra and AAA do not differ by a multiple 
of 2x, whilst they are the same if the amplitudes do differ by a 
multiple of 27. 

But the absolute value of the difference of two numbers (k; — k,) 
in the series (17) is less than n, and therefore the difference 


the amplitudes 


P+ my — p + 2k, = kı — k, on 
n m n 


cannot be a multiple of 2x, i.e. the n different values of k of series 
(17) correspond to the n different values of the root. Now let k, be 
an integer not included in series (17). We can divide by n and write 
it in the form: 


k, = qn + k, 
where q is an integer and k, is one of the numbers of (17), and hence 


g +2kn p+2kx 
S ge 


i.e. the same value of the root corresponds to k, as to k,, belonging 
to (17). Thus, the n-th root of a complex number has n different values. 

The only exception to this rule is the case of the number under 
the root being zero, i.e. r = 0, when all the above mentioned values 
of the root are zero. 
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3 
Examples. 1. We find all the values of y; The modulus of 7 is unity and 
its amplitude is z/2, and hence: 


> 5 + 2kr Z y oka 


Vim | cos + isin Z = eos — + tsin —q— (k =0, 1, 2). 





3 
Wo get the following three values for Vi: 


Ho n4 4 B 1. bn |... bn /3 | 1. 
sob or a BN ey operi cos-g- “Pt SIN Se rogó; 
3x o... 37 ; 
cos 75- + i sin -5- = — d. 


n 
2. We consider all the values of yı, i.e. all the solutions of the equation 


zh —]. 


The modulus of unity is unity, and its amplitude is zero, so that 


ies er 2kn |... 2kx 
VI = Veos0 + i sin 0 = cos —— + i sin —— (k = 0, 1,2,...,n—1). 


Let e be the value obtained for the root with k = 1: 
2x .. 27 
E€ = cos a? A 


By de Moivre's theorem: 


o ous OY NE LUN 
n n 


i.e. all the roots of the equation z” = 1 have the form: 
& (k = 0, 1, 2,...,n — 1), 


where we have to take e” = 1. 
We now consider an equation of the form 


aoa, 


We introduce a new variable u in place of z, putting 


n 
z —ula; 
n 
where |a is one of the values of the nth root of a. 
If we substitute this expression for z in the given equation, we get the 
equation for u: 
unm]. 


458 COMPLEX NUMBERS, ALGEBRA, INTEGRATION [176 


It is clear from this that all the roots of z” = a can be put in the form: 


n 
Jaek (k=0,1,2, ..., n—1), 
n 


where Va is one of the n values of this root and el takes all the values of the 
nth root of unity. 


176. Exponential functions. We considered above the exponential 
function e* in the case of a real exponent x. We now generalize the 
concept of exponential function to include a complex exponent. 
With a real exponent, e* can be put in the form of a series [129]: 


2 c3 
e=l+ FAS ++" 


We define the exponential function with a pure imaginary exponent 
by means of an analogous series, i.e. we write : 


421-4 p 8» TAE E. 








Separating the real and imaginary terms gives: 
y? Y (9 Y y 
e»! = (1-5 +i i8) vi 44...) 


from which we deduce, on recalling the expansions of cos y and 

sin y [130]: 

e” — cos y + ¿sin y. (18) 
This formula also defines an exponential function with pure imagin- 

ary exponent. 


Substitution of (—y) for y gives: 
e?! — cos y — i sin y. (19) 


Solution of (18) and (19) with respect to cos y and sin y gives us Euler's 
formulae, expressing the trigonometric functions in terms of ex- 
ponential functions with pure imaginary exponents: 

eo! 4 e^?! ei — gr 


5 ; siny = — —+" (20) 


cos y = 3i 


We obtain from (18) a new exponential form of a complex number, 
of modulus r and amplitude g: 


r(cos p + i sin p) = re”. 
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The exponential function with a complex exponent x + yi is defined 
by: 


xtyl 


ex th! — g* e! — e* (cosy + ¿sin y), (21) 


i.e. the modulus of e***! is taken as €", and its amplitude as y. 

The rule for adding exponents when multiplying is easily generalized 
to include complex numbers. 

Let z = x + ty and 2, = z +y, i: 


e". e^: = ecos y + isin y) * e*(cos y, + isin yj) , 
or, using the rule for multiplying complex numbers [172]: 


e^. e^ = e [cosy + yy) + isin(y + y)]. 


But it follows from the definition of (21) that the expression on the 
right-hand side of this equation represents: 


etx) yDU je, etta, 


The rule for subtracting exponents on division, 


z 
2 = eu 


e% 
can be verified at once by multiplying the quotient by the denomi- 
nator. 
We have, for a positive integer n: 


(e) — ee. e — e, 


We can use Euler's formula to express any positive integral power of 
sin y and cos g, or any product of such powers, as the sum of terms containing 
only the first powers of the sines or cosines of multiples of the angle: 


(e*! — e 7m re pa (e?! + e- ?iyn f 


>m 
l = 
sin” 9 gaa ; 9m 


(22) 


We obtain the required expression on expanding the right-hand sides of 
these equations by Newton's binomial formula, cross-multiplying, and replac- 
ing the exponential by the trigonometric functions in the expansions obtained. 

Examples. 1. 

(o?! E e- 9 et! 402%! 6 4e—27! e7tpi 


16 ^ + i6 ^98 6 ` 6 7 





cos! p = 


fo pace. 1 ettei 08. 3. 1 1 
FO ty ter rt gem tg costo. 
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2, 








$ ET ; m ont i —el 
: (er —e gh (e? + e ey (e?! —e iens (e? —e ) 
4 3 == As» — = 
Sin* y cos? 9 16 8 128 
(et?! e 3e29i s 3e ?pi m e- sel) (e?! T e^ 9l) B 
128 T 
e ol A espi E 3e?9i eum 3e%! 3r Za?! qn 3e7 set Res e be! q e Tot 
128 











3 3 1 1 
= g4 0939 — Gy 008 3p — ¿q cos bp — gg cos Tg. 

We remark here that any integral power of cos p and any even power 
of sin y represents an even function of p, ie. one whose value remains 
unchanged when g is replaced by (—q), and which contains only cosines 
of multiples of the angle. An odd function of g, i.e. one which changes sign 
when 9 is replaced by (— 9), is obtained, for example, in the case of odd powers 
of sin y; its expansion contains only sines of multiples of the angle, and the 
absolute term is entirely missing. All these details will be considered in more 
detail in our treatment of trigonometric series. 


177. Trigonometric and hyperbolic functions. We have so far con- 
sidered trigonometric functions only in the case of real arguments. 
We define these functions for any complex argument z by means of 
Euler’s formula: 


zi —2i zi 
e e e e" —e 
ate i sin Z = 





CO8 Z = 


where the expressions on the right have the meanings indicated in [176]. 

By using these formulae, together with the basic properties of ex- 
ponential functions, the trigonometric formulae are easily shown to 
hold for a complex argument. We propose the proof of the following 
relationships as an exercise to the reader: 


sin? z + cos?z = 1; 
sin(z + z,) = sin 2 cos 2, + cos z sin 2,; 
cos(z + z,) = cos z cos 2, — sin 2 sin 2,; 


The functions tan 2 and cot 2 are defined by: 











sinz 1 ef e`” 1 e#_] 
tanz= Wt ep Sa Ib. a o 
cos 2 i epe i ei} 
zi —zi 22i 
cos 2 e e e 1 
cot 2= T — mum i PESO E = 2 ti zi 
sinz e TM eH gal ro 1 


We now introduce the hyperbolic functions. 
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Hyperbolic sin and cos are defined by: 








" RA: zy) ai 
; sin iz e?—e , te 
sinh z = 2% = i cosh z = cos iz = LF 2; 
i 2 2 
sinh z mer? e? _ | 
tanh z == == = = 3 
coshz e+e? eo 41 
cosh z e+e? e? 4 1 
coth z = —— = ete” e“ +l, 


sinhz e — e“? ox] 


The following relationships are easily shown to hold by using these 
formulae: 


cosh? 2 — sinh? z = 1; 


sinh (2, + 25) = sinh 2, cosh za + cosh 2, sinh 29; 


cosh (2, + 2,) = cosh 2, cosh 2; + sinh 2, sinh 2, ; (23) 
sinh 22 = 2 sinh z cosh z, cosh 22 = cosh?z +- sinh?z; 

_ 2tanhz old coth?z 
tanh 22 = T+ tanbtz , coth 22 = -acota 


The above relationships are analogous to the relationships in the 
ordinary trigonometry of angles, and give rise to hyperbolic trigono- 
metry. The formulae of this latter are obtained from the corresponding 
formulae of ordinary trigonometry by replacing sinz by isinhz, 
and cosz by cosh z; this follows at once from the formulae defining 
the hyperbolic functions. 

By using the above principle, we easily obtain the following for- 
mulae for reducing the sums of hyperbolic functions to the logarithmic 
form: 


sinh 2, + sinh 2, = 2 sinh ats cosh 2 ; 


sinha, — sinh z, = 2sinh 215% co A 
(24) 


—£ 
cosh z, + cosh 2, = 2 cosh Atr cosh fU ; 


: z 2 . 247 2 
cosh 2, — cosh 2, = 2 sinh Arf sinh 2. 


We now consider the hyperbolic functions with real values of the 
argument: 








Xo awk x -x 
sinh z = S ; cosh z = 5 59 — 

ex —.— 2x 1 
tanhz- 5L; eothz = < EE 

e* .-1 e* 1 
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The graph of y = cosh x is a catenary [78], which is dealt with in 
greater detail in [178]. 

The graphs of cosh x, sinh x, tanh x and coth v are illustrated in 
Fig. 171. 

Direct differentiation gives us the following expressions for the 
derivatives: 








dsinh g d coshz ; g 
Ta = cosh x ; aa = sinh 2; 
dtanhz — 1 . deothz — CEN 
dz ^ cosh?z ’ dz `  sinh?g 


Y 





Fie. 171 


Hence we obtain the table of integrals: 
f sinh z dz = cosh x + C; f cosh z de = sinh x + C; 


=tanhz+C; = —cothz+C. 


J sis E 
cosh? a sinh? x 

The name ‘hyperbolic functions" is due to the fact that cosh £ 
and sinhí play the same role in the parametric form of the rect- 


angular hyperbola 
2 — y= a, 


as cos £ and sin t play as regards the circle 


et y = a. 
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The parametric form of the circle is 
x=acost; y=asint, 
whilst that of the rectangular hyperbola is 
x = a cosh t; y = a sinh t, 
as is easily seen on using the relationship: 
cosh? £ — sinh? t = 1. 


The geometrical significance of the parameter ¢ is similar in both 
these cases. If S denotes the area of the sector AOM (Fig. 172), and 


aN 


Fria. 172 Fig. 173 





S, denotes the area of the circle (Sy = 2a’), we evidently have 
S 
t= idi e 


Let S now denote the area of the corresponding sector of the 
rectangular hyperbola (Fig. 173). We have: 
x 


S = area of OMN — area of AMN = sey — [ydz — 


a 
x 


=> 2 (Hal | Ya? — a? de. 


a 





On evaluating the integral in accordance with the formula of [92], 
we find: 


= 70 Va? — a? ; [z /2? — a? — a? log (x + Ya? — a?)]7 = 








1 gi 
= at log (= J£ - 1). 
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If we now put 





p? S o x at 
t= 223 = log (2 + |5- i), 
where S, again denotes the area of the circle, we easily find: 


t2 zi 
ini al, 








etd -£.-J[&-1. 

w x 

ar Jz -1 

and on adding and multiplying the result by a/2 ; 
E 5 (e + e!) = a cosht, 


y = Vx? — a? = Va? cosh? t — a? = a sinh t, 


ie. we obtain the parametric form of the rectangular hyperbola. 


178. The catenary. We investigate the curve in which a flexible cord of 
uniform density hangs when supported at its ends A, and A, (Fig. 174). 


Y 







Az (07 +62) 


tan (as da) 
A, (a, , 5) 


Fra. 174 


We take the axis OX along the horizontal in the plane of the curve, and 
OY vertically upwards. We divide the cord into elements MM, = ds. The 
forces acting on each element are the tensions T and T, from the remainder 
of the eord and the weight of the element. The tensions act along the tan- 
gents at the ends M and M, of the element (7 is in the negative direction 
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of the tangent, T, in the positive direction). The weight of an element is 
proportional to its length: 
dp = ods, 


where g is the linear density of the cord (the weight of unit length.) 

The necessary and sufficient conditions for equilibrium are that the 
sums of the horizontal and vertical components of the forces acting on the 
element are both zero. The horizontal component of the weight dp of the 
element is zero, so that the horizontal components of 7 and T, must be 
equal in magnitude and opposite in sign. Let T, denote the common magnitude 
of these horizontal components. 

The following expressions for the vertical components of the tensions are 
obtained from the figure: 


—T, tana = —T,y’ and T, tan (a + da) = Tuy” + dy’). 


Here, da denotes the increment of the angle a, formed by the tangen- 
with OX, when we pass from the point M to M,, and dy’ is the correspondt 
ing increment in the slope of the tangent, i.e. in tan a. 

We find, on equating to zero the components of T, T, and the weight 
ods along OY: 

Ty" + dy’) — T, y' — ede = 0, 


T, dy’ = eds, 


i.e. 


whieh can be written as: 
T dy’ —0oY1-4- y^ dz. (25) 
We now separate the variables [93]: 


EN. ae == St , where k = To, 
Vicry? k e 
it is to be noted that k is a constant, directly proportional to the horizontal 
component of the tension, and inversely proportional to the linear density 
of the cord. 
We integrate the equation obtained: 


1 C 
log (y-- Yi y?) - 22, 


whence 
E 


ek =y+V1+y*; 
then we find y” by introducing the reciprocal: 


x+C1 





1 a 
k = =Y1 +y? —y'. 
8 TA y y 
y' - Yl y^ 
We subttact this equation from the previous one; which gives 
x+Cy =r | 
vy ae at k _e K 
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On integrating a second time, we get the required equation for the curve 
of the cord: 
x+C1 xd 


ytO,= ale * +e & 


=> (26) 

The arbitrary constants C, and C, are found from the fact that the curve 
passes through the points A,(a,, b,) and A,(a,, da). In practice, however, the 
main interest is not so much in the actual equation of the curve, i.e. in the 
constants C,, C}, as in the relationship between the horizontal and vertical 
distances between the points of suspension and the length of arc A, Az. 

When studying the relationship between these three magnitudes, we can 
of course make a parallel displacement of the coordinate axes. We can put 
the origin at the point (—C,, —C,), in which case, from (26), C, = CO, = 0, 
and (26) can be written more simply: 


k {= ~; c 
velis El con E, (26,) 
from which it is clear that the equation of the suspension is a catenary. 

Let the coordinates of A, and A,, with the above choice of axes, be 
(a,, b,) and (a), b,). Let l, h, s denote respectively the horizontal and vertical 
distances between the points of suspension, and the length of the cord; 
then 


l=a,—4a, h= by — b, = k (cosh $2 — cosh 7] ; 


a, li LS ay 
8= [n yfàs- [ m + sinh? £ de =| cosh 2 de =k (sinn — sinh 7t. 
a, a, a, 


We find on using (24): 


= sg Ag+ y Ay A apl upt 
h = 2k sinh 3h sinh 2k 2k sinh zz sinh abo 

















= y Gy — 0 Gy + a, : l % + a, 
8 — 2k sinh 3k cosh 2E 2k sinh 2r cosh 2E , 


which gives us, on using the first of relationships (23): 


8? — h? = 4k? sinh? 2E A 
representing the required relationship between l, h and s. 
This can also be written in the following form: 


, l 
sinh ak Vx " 
MEME NEM oa ch (27) 
d l 
2k 
If the points of suspension and the length of the cord are given, the 
magnitudes l, h and s are known, and we obtain an equation for the para- 
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meter k; if the linear density o of the cord is also known, (27) can be used to 
find the horizontal component of the tension T. 
For the sake of brevity, we put: 


l Vs? — h? 
l 








C. 


Equation (27) becomes: 


inh £ 
o = 6. (27,) 


The exponential function can be expanded into a power series [129], so that 
we can write 


sinhé eb—et — Ea p, 
E DE =l+q tay tates 








from which it is clear that, as ¿ increases from 0 to -+ oo, the ratio also in- 
creases steadily, from 1 to + co, Accordingly, for every given c > 1, (27,) 
has one positive root, which can be obtained from tables of hyperbolic func- 
tions. T 
The given magnitudes l, h, s must here satisfy the condition: 
Veh 


aia -plorse’>pht+, 





which is also obvious geometrically, since yh? + 1 is the length of the chord 
A, A), whilst s is the length of the catenary between these points. 
For example, let: 


8 — 100 m., /=50m., h=20m., p= 20 kg/m. 
Wo get: 

c = 0.02 y 10,000 — 400 = 0.8 /8 = 1.96 
and we find the root of (27,) from the tables: 














l 
= ap 2.15, 
whence 
k 50 
0 « = 
T ko ar? 2x2.15, 20 — 232 kg. 


Let the points of suspension be at the same height. We investigate the sag 
jof the cord (Fig. 175): 


Esa enc eie. . in L et 
f=0A— = Haro z) r IAM x 9. 


2 


bol a 


T The tables of Jahnke and Emde, for instance. 
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Expansion of the exponential functions gives us: 


1 8 1 Bh 
l^;:smatam (28) 





We have in exactly the same way for s = arc A, A, [with h = 0 in 


(2D): 


1 B 1 i5 


1 y 
ee ace (os ~-a) 
8 = 2k sinh 57 =kle — e l+ aat Bp EA 





doa (29) 


We can find k approximately by confining ourselves to the first term of 
series (28): 
Bee 
nu "8f . 
We preserve the first two terms of expansion (29) and substitute the 
expression found for k: 


00 


nt um 


e 


Differentiation of this expression gives us the relationship between the 
increase in length of the cord and the increase in sag: 
16 faf 31 


av, or df ^ Tef 


3 1 ds. 


We obtained equation (25) on the assumption that the gravity force act- 
ing on each element of the cord was proportional to the length of the ele- 
ment. In certain cases, e.g. when considering the chains of suspension bridges, 
the gravity force must be reckoned proportional to the length of the projec- 
tion of the element on the horizontal axis. This happens when the loading 
due to the roadway of the bridge is large compared with the weight of the 
chains themselves, so that the latter can be neglected. We have now, instead 
of equation (25): 


T dy’ = edz, 
whence 
0 
y= pete, 
Ti 1 
and 


= 2 42 
y= ar, * +C,x+C,, 


i.e. the curve of the suspension is a parabola. 

If we suppose that the ends A, and A, of the chain (or cord) are at the 
same level, whilst we locate the origin of coordinates at the vertex of the 
parabola (Fig. 176), its equation will be: 


= ag? EA TE 
y = ax (o zr.) 
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We find, as above, the length of the span | = A, A, and the sag f = OA. 
We get from the equation of the parabola: 


2 
poor, 


whence 





4f 


a = —-—' 


E 
We calculate the length of arc 4,4», 
which is twice the length of arc OA,: 


1/2 
8= 2 [or + data do. 
0 





Fra. 176 


We have by Newton's binomial formula: 
VI + 407 a? — (1 + da? a2)? — 1 + 2a? at — ait p.n. 
and integration gives us the expansion for 8: 


ei Lat at. 


We substitute the expression found above for a: 
E 8 (f)\t 32 (fM = 8 32 
o=1+5(4] Ely) nl lee...) 


where e = fit. If we confine ourselves to the first two terms of this series, 
we obtain the approximate formula: 
8 m 
ey T 
which is the same as the corresponding formula for the catenary. 


179. Logarithms. The natural logarithm of the complex number 
r(cos o + ¿sin q) is defined as the power to which e must be raised to 
give the number. If we use Log to denote natural logarithms, we can 
say that the equation 


Log[r(cos p + ? sin g)] = x + y? 


is equivalent to: 


x+y! — r(cos p + isin g). 


e 
The latter equation can be written: 


e*(cos y + isin y) = r(cos p + i sin 9) , 
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whence we find, on equating moduli and amplitudes: 


e = r, y = ọ + 2ka (k = 0, +1, +2 ...), 
i.e. 
x = logr and «+ yi = log r + (p + 2kn) 2 
and finally: 
Log[r(cos p + ? sin g)] = log r + (p + 2kx) 2, (30) 


ie. the natural logarithm of a complex number is the complex number 
whose real part is the ordinary logarithm of the modulus and whose 
imaginary part is the product of i and one of the values of the amplitude. 

It follows that the natural logarithm of any number has an infinite 
set of values. The only exception is zero, the logarithm of which does 
not exist. If we make the amplitude obey the inequality 


— TI < 9 < A > 
we obtain the so-called principal values of the logarithm. The principal 
value of the logarithm is denoted by log instead of Log, to distinguish 
it from the general value, as given by (30); thus 
log[r(cos p + ¿sin g)] = logr + gi, (31) 
where 
—7 «gx m. 


We define the complex power of a complex number with the aid of 
logarithms. If u and v are two complex numbers, where u 4 0, we put: 


L 
u? = e cogu 


We note that Log u, and therefore w^, has in general an infinite set 
of values. 


Examples. 1. Tho modulus of ¿ is unity and its amplitude is 4 x, so that 


Logi— (za + 24m) i (b — 0, £1, 2, ...). 
2. We find dl: 


1 
$ : =(- 2k 
¿oi e) (k= 0, +l, E2... 


180. Sinusoidal quantities and vector diagrams, The application of complex 
quantities to the study of harmonic oscillations may be mentioned. We 
consider a variable current j, whose value at each instant is the same through- 
out the circuit, and is given by: 

j = jg sin (wt + p), (32) 


where ¢ represents time, and j,,, o and y are constants. 
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The constant j,,, reckoned positive, is called the amplitude, w is the 
frequency, connected with the period T by the formula 


p2 a 
w 
and g is the phase of the alternating current, 
A current whose instantaneous value varies in accordance with (32) is 
said to be sinusoidal. What has been said also applies to a voltage: 


Y = tQ Sin (wt + py), (33) 


and our discussion below is concerned with sinusoidal currents and voltages, 
as given by (32) and (33). 





Fie. 177 


There is a simple method of representing geometrically sinusoidal quanti- 
ties of the same frequency. We take a radius vector through a fixed point 
O of the plane, and let it rotate clockwise with angular velocity w; this radius 
vector is referred to as the time axis. 

Let the time axis coincide with axis OX initially, i.e. at ¢ = 0. 

We draw a vector OA (Fig. 177) of length jm, forming an angle o with 
the initial position of the time axis (recalling that angles are reckoned positive 
in the counter-clockwise direction). At time t, OA will form an angle (p + œt) 
with the time axis, which has rotated through an angle wt. We clearly get 
the quantity j = jm sin (wt + p) from the projection of OA on the perpendic- 
ular to the time axis, obtained by turning this latter counter-clockwise through 
an angle 7/2; or more briefly, j is obtained by dropping a perpendicular from 
the end of OA on to the time axis, and taking its length with the appropriate 
sign. 

Another sinusoidal quantity of the same period, 


¡O = 0 sin (ot + p), 


must be represented by drawing a vector of length jo at an angle to the 
first equal to 
y—q9 — $9. 
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Sinusoidal quantities of the same frequency can thus be represented by 
fixed vectors in a plane. The length of each vector gives the corresponding 
amplitude, whilst the angle between any two represents their phase dif- 
ference. This type of construction is called a vector diagram of the system of 
Binusoidal quantities of the same period. 

The sum of several such sinusoidal quantities is also sinusoidal and of 
the same period, being represented by the sum of the corresponding vectors 
on the diagram. 

The definition of multiplication given in [172] can be used to put operations 
with vector diagrams into a convenient analytic form. 

We shall in future denote vectors by letters in heavy type. 

We shall reckon the product of a vector j and a complex number re”! as the 
vector obtained by multiplying the length of j by r and turning its direction through 
an angle o, i.e. we multiply the complex number representing j by the com- 
plex number re?! in accordance with the rule given in [172]. 

If ro”! is given in the form (a + bi), we can write the product as the sum 
of two vectors: 

(a + bi) j = aj + bij, 


tho first term being a vector parallel to j, and the second a vector perpendi- 
cular to j. 

We can split any vector j, into two mutually perpendicular components, 
and write it in the form: 


jı = aj + bij = (a + bi) j. 


Here, | a + bé | evidently gives the ratio of j to j,, whilst the amplitude 
of (a + bi) represents the angle between the two vectors. This angle gives 
the phase difference of the quantities corresponding to the vectors. 

We bring in the concept of the mean square value, denoted by M(j*), 
of the sinusoidal quantity (32). This is defined by the equation: 


1 T 
M (P) =y | Pa 
We have: j 
P = jh sint (ot + 9) = + fh — > fh 008 2 (ot + ¢) 


and integration from 0 to T = 2x/w gives: 
1, 1 s enlo l, 
MG) = yh | y ta sin 2 (ot + 9) =z m. 


The square root of the mean square value is called the effective, or root 
mean square, value of the quantity: 


jac fag = m 


In practice, each vector in a diagram is usually made equal in length to 
the effective value of the quantity, and not to its amplitude, i.e. the lengths 
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of the vectors are diminished in the ratio 1: y2 by comparison with the con- 
struction given above. 
We obtain by differentiating (32): 


S. jm COS (wi + p) = oj sin (o ++ +) , 
i.e. the derivative dj/dt only differs from j in that its amplitude is multiplied 


by c, and its phase increased by 2/2. 
The relationship deduced becomes in vector notation: 


dj as 
a ^ 7- (34) 
Integration of (32) gives us: 
j dt = a os (wt + puts sin (wt + ES 
fi = — y Ímoos ( 9) — 5 m 9-32), 


where the arbitrary constant has to be neglected if we want to obtain a 
quantity that is also sinusoidal; and it follows from this that:f 


fia (35) 


181. Examples. 1. We consider an alternating current circuit consisting 
of a resistance R in series with an inductance L and a capacity C. If y de- 
notes voltage and j denotes current, we know from physics that 


O ee XS 


We confine ourselves for the present to the steady state conditions, and 
to the case when the voltage and current are sinusoidal quantities of the 
same period. The above equation can be written in vector form, with v and 
j replaced by the vectors of voltage and current, y and j: 


eee a re or 
v= Bj+L- uias 
We find from this on recalling (34) and (35): 


; 1 ae 
v= Rj+ Li + ar j= (B+ u0)j=¿j, (36) 
where 


1 ; 
u=wL- -p >’ and = E 4 ui. (37) 


The relationship obtained between the voltage and current vectors has 
the ordinary form of Ohm’s law, except for the fact that the ohmic resistance 
is here replaced by a complex factor, called the impedance of the 


t The symbol dj/dt denotes the vector corresponding to the sinusoidal 
quantity dj/dt, whilst f jdt denotes the vector corresponding to $ jat. 
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circuit, and made up of three “impedances”: an ohmic resistance R, an in- 
ductive impedance (oL4) and a capacitative impedance (1/01). 

We also get from (36) the resolution of vector v into two components: 
Rj in the direction of j, and uij perpendicular to j. The first is called the 
real, and the second is called the watiless, component of the voltage. These 
terms become clear if we calculate the mean power W in our circuit, which 
we define as the arithmetic mean over a full period of the instantaneous 
power vj: 

T E 
W = yfe dt = 77. f sin (ot + p,) sin (wt + g,) dt ; 
0 0 


here, y, denotes the phase of the voltage, and p, the phase of the current, 
so that 


v= Um SİN (ot + q); 7 = jm Sin (ot + Pa) - 


We easily find: 





T 
W = “aim [ [cos (v, — Ya) — cos (2wt + q, + pa)] de 


0 


Um) 


= O cos (p, — Pa) = eff jer COS (P, — Q2)- (38) 


The greatest absolute value of mean power is thus obtained when the phases 
of the voltage and current either coincide or differ by x, whilst the least, 
zero, power is found when the phases differ by 7/2. 

The wattless component wij of the vector v gives a mean power of zero 
when put into this expression for W, since wij is perpendicular to j, i.e. 
cos (p, — pz) = 0 in this case; thus, all the mean power that passes into 
Joule heat is provided by the real component alone. 

We can write (36) as: 


1 


o where y = -=9 hi, 


1 
R+u 
or 


j=gv + hiv. 


The complex factor y is called the admittance of the circuit, and is 
equal to the reciprocal of the impedance This last formula expresses the 
current vector as the sum of a real component in the direction of v, and 
a wattless component perpendicular to v. 

2. The basic rules, derived from Ohm's and Kirchhoff's laws, for finding the 
resistance of a direct current circuit that includes several resistances in series 
or parallel, can also be used for steady sinusoidal alternating current circuits, 
provided we agree to replace the instantaneous values of voltage and current 
by the corresponding vectors, and the ohmic resistances by the impedances. 

Thus, if the circuit includes the impedances in series: 


E, = Ri + git; = R, + 4.4; ..., 
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the voltage and current vectors will be connected by: 
v= Cj, where (7 = 0, +04 ..., (39) 


j.e. impedances are added in a series circuit. 

If the same impedances are in parallel, we have on the other hand: 

v= ¢’j, where Bee Ne gt (40) 
E7 A & — 

i.e. the admittances are added in a parallel circuit. 

The total impedances in the case of series connection of impedances 
En la <.. is found geometrically by the simple process of constructing 
the geometrical sum of the vectors represeriting these complex numbers. 





Fic. 178 Fria. 179 


We indicate the construction in the case of two apparent resistances 0, 
and ¢,, connected in parallel. We have by the above rule: 





1 El 

WS o 1. A OS " 

(d.d th 
či Čs 

If we put: 
U^ = pe, a= p, e^! ; h= 00°?! ; Èi t é: = qye, 
we have: 
0. 
peres 8 = 0, + 8z — 8o. 


This leads us to the following geometrical construction (Fig. 178).t 
We first find the sum ¢,-+ ¢, = OC; then we draw 4 AOD, similar to 


ACOB, by turning ACOB to the position C’OB’ and drawing AD||C'B'. 
We deduce from the fact that the triangles are similar that: 


An — OB . 210. 
OD=04A-—, ie. ọ= >, —9,—90, (0, =0), 
00 2o tee 
which is what we wanted to show. 
3. We consider the coupled oscillations in two magnetically coupled 
circuits (Fig. 179). Let v,, 7, denote the external electromotive force and 


+ We have simplified the figure by taking the axis OX along the vector 
¿,, amounting to the assumption that 6, = 0. In the general case, it is suf- 
ficient to turn OX clockwise through an angle 6,. 
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the current in circuit I, and j, the current in circuit 11 (with no external 
electro-motive force); let R,, R}, Li, La, C,, C, be respectively the resistances, 
inductances and capacities in the circuits, and M the mutual inductance 
between the circuits. 

We have the relationships: 


os 





dj. 
v= Rj +L, + M— J- Lg fiar, 


Ou Rj, + Dy A +M " 





+ o. fia 


When we consider the steady state, in which the currents and voltages 
vary sinusoidally at the same frequency, these equations can be written in 
vector form: 


vr (a, + 0L1 + so] j + OMij, = 6j, + oMij, 


3s z 1 : ^. 
0 = oMij, + (a, + oL + 505) jz: = o Màj, + 6 , 


where ¢, and 06, are the impedances of circuits I and II when taken separa- 
tely. 
We can easily solve for j, and ja: 


t, " Mi 
h^ Aaa b5- Tey om’ 


On re-writing the first equation as: 
w*M?), 
v, = (c. + LA) ji: 
2 


we can say that the presence of circuit II changes the impedance of circuit 
I by an amount w*M?/¢,. 


182. Curves in the complex form. If we agree to represent real numbers 
by points on a given axis OX, the variation of a real variable is expressed 
by a corresponding displacement of the point on OX. In exactly the same 
way, the variation of a complex variable ¢ = x + yi amounts to displacement 
of the corresponding point on the plane XOY. 

The case of particular interest is when the point ¢ describes a certain curve 
during its variation; this occurs when the real and imaginary parts, i.e. the 
coordinates x and y, are functions of some parameter u, which we shall take 
to be real: 


x = piu); y = exu). (41) 
We shall now write simply: 
¿=f(u), where f(u) = g,(u) + $o(u) , 


and this equation will be referred to as the equation of curve (41) in the com- 
plex form. 


182] CURVES IN THE COMPLEX FORM 477 


Equations (41) give the parametric form of the curve in rectangular co- 
ordinates. We arrive at the form in polar coordinates by writing ¢ in the 


exponential form: 


9: o =p (u), 0 = pu). 


S= pe 


The factor o in this expression is the same as | ¢ |, whilst el, which coin- 
cides with the “sign” (+1) in the case of real ¿(9 = 0 or x), is a vector of 
unit length, denoted by the symbol: 

NEA 
sgn ¢ = ebi = — 
n 
(sgn is an abbreviation of the Latin word “signum” — sign.) 

The necessity for considering the equations of curves in the complex form 
arises out of vector diagrams. If we take the current vector j as constant 
in the relationship 

v= Oj, 


but allow one of the circuit constants to change, the impedance Y and 
the vector y will also change; the end of vector v describes a curve, called 
the voltage diagram, and the construction of this gives us.a clear picture of 
the variation of v. The point ¢ also describes a curve (the impendance diagram), 
which differs from the voltage diagram only in the choice of scale (taking 
the vector j as unity). 

We now consider the equations of some simple curves. 

1. The equation of a straight line, passing through the point C, = x, + 
+ yo *, and forming an angle a with OX, is: 


C= to + uot; 


the parameter u here denotes the distance from point ¢, to C. 
2. The equation of a circle with its centre at ¢, and radius r is: 


C=C, + re", 


3. An ellipse with its centre at the origin, its major axis along OX, and 
semi-axes a and b, has the equation in the complex form [177]: 


č = + yi = a cosu + bi sin u == (a + b) e" 4 (ab) 0, 


If the major axis makes an angle p, with OX, the equation of the ellipse 
takes the form: 


t= efe [+ (a + b) e + i (a — b) e . 


In the general case, when the centre of the ellipse is at ¢, and its major 
axis makes an angle p, with OX, its equation is: 


Dee [ha + be rene]. 
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If b = a, this equation becomes the equation of a circle of radius a: 
SH Ey + aceti, 


where (g, + u) is a real parameter like u. 
If b = 0, we get a segment of a straight line: 


t — 0, + ae?! i (e + e7") =t, + ae! cosu; È=, + vot”, 


making an angle py, with OX, of length 2a, and with its centre at [,, since 
the parameter v = a cos y is real, like u, but can only take values between 
(—a) and (+a). 

If we consider the circle and the straight line segment as limiting cases of an 
ellipse, when its minor semi-axis becomes equal to the major, or becomes 
zero, we can say in general that the equation 


E= E + no euge i, (42) 


where ¢,, 4, and u, are amy desired complex numbers, always represents the 
equation of an ellipse. 
This follows by putting: 


; 1 1 
A =M," y h= Me" ; 3 (0, + 02) = 9; * (8, — 82) = 65, 


when (42) can be written in the form: 
t=44+M, elt +8) i +M, e^ (u—9)) i — m eoi [M, e(t 99) i 4- M, e (189) 4, 


from which it is clear that the curve in question is in fact an ellipse with its 
centre at Čo semi-axes (M, + M,), and its major axis making an angle p, 
with OX, i.e. in a direction bisecting the angle between the vectors u, and fo. 
The ellipse becomes a circle for M, = 0, and a segment of a line for M, = M,. 

4. Curves of great value in the study of alternating current phenomena 
in circuits with continuously distributed of resistance, capacity and induc- 
tance, are those with equations of the complex form: 


tre", (43) 


where v and y are any desired complex constants. 
We have here, on putting » = N, oi, y = a + bi and using polar co- 
ordinates: 


EM oc?! = Neol e(a bi) u — Nyet# e(bu+eo) b 


that is: 
0=N,e%%, 6 = bu + po, 
whence 
_ LP 
u= B ; 
or finally: 


o= Net (N= Ne?) , 
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i.e. the curve in question is a logarithmic spiral (Fig. 180), for the case 
a/b > 0). 
More difficult curves of the type: 


Ç = vent 4 yer: 4 y ort 
can be obtained by constructing the ‘‘component spirals”: 
f= ye", D. = v, 074%,..., C= vere, 


and working out geometrically, for each value of u, the sum of the corres- 
ponding values of £,, 6, ..., C, (Fig. 181). 


C= 
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183. Representation of harmonic oscillations in complex form. A damped 
harmonic oscillation is expressed by the formula: 


x= Ao” sin (wt + po), (44) 


where Á and e are positive constants. We introduce the complex quantity 
here: 


EL AU)! pote) it Ace ett (ot + 9-22) i : (45) 


The real part of this complex quantity coincides with the expression (44). 
We can thus represent any damped harmonie oscillation as the real part 
of a complex expression of the form: 


[a = aeft, 


where a and f are complex numbers. In the case of (45): 


1 
a= qa! and =w +e. 


For a pure harmonic oscillation without damping, e = 0, and f becomes 
& real number. m 
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Expression (45) for ¢ coincides with (43) when 


1 F 
jon” 3") ,yp=(0+e)i=—e+ od and u=t. 


It is clear from this that the point ¢ describes a logarithmic spiral as t 
varies, the polar angle 6 being a linear function of time t: 


1 
0 0t + $y 0s 


ie. the radius vector from the origin to ¢ rotates about the origin with 
constant angular velocity c. The projection of ¢ on OX performs the damped 
oscillation (44). If e = 0, ¢ moves on the circle o = A, and its projection 
on OX moves according to the law of harmonie oscillation without damp- 
ing: 

x= Asin (wt + q,). 


S 18. Basic properties and evaluation 
of the zeros of integral polynomials 


184. Algebraic equations. We undertake the study in the present 
article of the integral polynomial: 


fe) eag"-pag ee ag" he + Ana H An, 


where ao, Q4, ..., a*, ..., a, are given complex numbers, 2 is a complex 
variable, and the initial coefficient a, can be assumed to differ from 
zero. The primary operations with polynomials are familiar from ele- 
mentary algebra. We shall only recall the basic result as regards 
division. If the degree of a polynomial g(z) is not higher than the degree 
of another polynomial f(z), f(z) can be expressed in the form: 


$2) = p(2) - Qe) + RE), 


where Q(z) and R(z) are also polynomials, the degree of R(z) being 
lower than that of p(z). We speak of Q(z) and R(z) respectively as the 
quotient and remainder of the division of f(z) by g(z). The quotient 
and remainder are completely determinate polynomials, so that f(z) 
is expressed uniquely in terms of g(z) in the above form. 

The zeros of the polynomial are defined as the values of z which, when 
substituted in the polynomial, cause it to vanish. The zeros of f(z) are 
thus the roots of the equation: 


fe =a +a H bau. l.p a, az a4 0. (X) 
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The equation written is called an algebraic equation of the nth 
degree. 

If f(z) is divided by (z — a), the quotient Q(z) is a polynomial of 
degree (n — 1) with initial coefficient a, whilst the remainder R will 
not contain z. The identity follows from the basic definition of division: 


fle) = e — a) QU) + R. 
We get on substituting z = a in this identity: 


R= f(a), 
i.e. division of a polynomial f(z) by (z — a) gives a remainder equal 
to f(a) (Bézout’s theorem). 


In particular, a necessary and sufficient condition that f(z) be divi- 
sible by (2 — a) with no remainder is that 


fla) — 0, 
i.e. the necessary and sufficient condition for the polynomial to be divis- 
ible without remainder by the linear factor (z — a) is that z =a be a 
zero of the polynomial. 


Thus, knowing that z — a is a zero of f(z), we can divide the poly- 
nomial by the factor (z — a): 


fG) —G-—2)f,. 


where 
filz) = b 7 bug p... bn-22 + bn (bo =p); 
finding the remaining zeros leads to the solution of the equation: 


bg + bz"? + DE + bn—22 + bn-1 =0 


of degree (n — 1). 

Before proceeding, we have to answer the question: does every 
algebraic equation possess roots? The answer can be in the negative, 
as regards non-algebraic equations; for instance, the equation 


€ —0 (2=x-+ y) 


never has a root, since the modulus e* of the left-hand side does not 
vanish for any x. In the case of algebraic equations, however, our 
question has an affirmative answer, contained in the following basic 
theorem of algebra: every algebraic equation has at least ome real or 
complex root. 
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We state this theorem here without proof, The proof will be found 
in the third volume, on the theory of functions of a complex variable, 


185. Factorization of polynomials. In accordance with the basic 
theorem, every polynomial 


fA =al tart... d an 12 +4, (2) 
has a zero 2 == z4; it can thus be divided by (z — z,) and written [184]: 
fO) = @ — 2) (0427 *4+...). 


The second factor on the right-hand side of this equation has a 
root z = 2, by the same theorem, and can be divided by (2 — zə); 
so that we can now write: 


fe = (2 — %) (2 — 4) (02? +...). 


On continuing to divide out the linear factors, we finally obtain the 
factorial form of f(z): 


f() = ade — 21) (22)... (2 — en), (3) 


i.e. every polynomial of the n-th degree can be split into (n + 1) factors, 
one of which is the initial coefficient, whilst the remainder are linear 
factors of the form (z — a). 

At least one of the factors of (3) vanishes on substituting z = z, 
(s = 1,2, ..., n), so that these z = z, are zeros of f(z). 

A z differing from all the z, cannot be a zero of f(z), since none of 
the factors of (3) vanishes for this z. 

If all the z, are different, f(z) has precisely » different zeros; if some 
2, are the same, the number of different zeros of f(z) is less than n. 

We can thus state the theorem: a polynomial of degree n (or algebraic 
equation of degree n) cannot have more than n different zeros. 

An immediate consequence of this theorem is the proposition: 
if a polynomial of degree not higher than n is known to have more 
than n different zeros, all its coefficients and its free term must be zero, 
i.e. the polynomial is identically zero. 

Suppose that two polynomials f,(z) and f,(z), of degrees not higher 
than n, have the same value for more than n different values of z. 
Their difference, f,(z) — f(z), is a polynomial of degree not higher 
than n, with more than n different roots, and therefore identically 
zero; f,(z) and f,(z) thus have the same coefficients. If two polynomials, 
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of degrees not higher than n, have the same values for more than n different 
values of z, they have the same coefficients and the same absolute term, 
i.e. they are identical. 

This property of polynomials lies at the basis of the so-called method 
of undetermined coefficients, which we shall make use of later. The 
essence of the method consists in the fact that the identity of two 
polynomials implies that they have the same coefficients for the same 
powers of z. 

We obtained the factor form (3) by dividing the polynomial f(z) 
by linear factors in a definite order. We now show that the final form 
is independent of the way in which we divide out the factors, i.e. that 
a polynomial has a unique factor form (3). 

Suppose that, in addition to (3), we have the factor form: 


f) = bolz — zi) (z — za) ... (@ — za). (3,) 
On comparing these two forms, we can write the identity: 
ale — 21) (z — 2) ... (2 — Zn) = bolz — 21) (2 — 23) .. .(2—2p). 


The left-hand side of this identity vanishes for z = z,, and the same 
must therefore be true of the right-hand side, i.e. at least one of the 
zi, must be equal to z,. We can take, for exanple, 2, = 21. We cancel 
(z — 2,) from both sides and get the equation: 


aa — 2) ... (2 — Zn) = balz — 23) (... (2 — 25). 


This is valid for all z, including z = 2,. But this equation must 
now also be an identity, by the proposition proved above. We can 
use the same argument as above to show that 25 = z,, and so on, and 
finally, that b, = dy, i.e. the form (3,) must coincide with (3). 


186. Multiple zeros. Some of the 2, appearing in (3) may be the 
same, as already mentioned. We can put the same factors together in 
groups, and write: 


f() = ad — 2) (2 — z)^... (@ — zs)", (4) 
where 2,, 2, ..., Zm are different, and 
kit kt... Eg = 0. (5) 


If there is a factor (z — 2s)'* in the factor form written, z = 2, 
is called a zero of order k,, and in general, the zero z — a of the 
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polynomial f(z) is called a zero of order k, if f(z) is divisible by 
(z — a)", but not by (z — a)**?. 

We now give another test for multiple (repeated) zeros. We intro- 
duce Taylor’s theorem here, and first of all remark that the derivatives 
of the polynomial f(z) can be defined by means of the same formulae 
as were valid in the case of a real variable: 


f (e) = ag" + a, 2773 pap He. + agi 2+ d; 
f (2) = nagz"* + (n — 1)a,277* +... 4 (n — k)ja2 7 4 
eet ani; 
F (2) = n(n — I) agz"? + (n — 1) (n — 2) a2"? +...4 
+ (n — k) (n — k — l)ag? ec ek: 


Taylor’s formula: 





fi) =f (a) +2527 (a) + Er (a) + 


k a)" 
+t E 0) 


consists of an elementary algebraic identity in a and z, valid for com- 
plex, as well as real, values of these symbols. 

We now deduce the condition for z — a to be a zero of order k 
of f(z). We write (6) in the form: 





F) = (2 — a |i (a) + qa P9 (a) + 


Ade EZ fa] 





t [fe ir a+... + 679 T f]. 


The second polynomial in square brackets on the right is of lower 
degree than (z — a)", so that clearly [184], the first square bracket 
is the quotient, and the second the remainder, of the division of f(z) 
by (z — a)”. The necessary and sufficient condition for f(z) to be divi- 
sible by (z — a)* is that this remainder should be identically zero. 
We consider it as a polynomial in (2 — a), and get the following con- 
ditions: 





fa) — f'(a) =... = a) = 0. (7) 
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We must add to these the condition 
Pa #0, (8) 


since if f(a) = 0, f(z) is divisible by (z — a)*** as well as by (z — a)". 
All in all, (7) and (8) give the necessary and sufficient conditions for 
z = a to be a zero of order k of f(z). 
We put p(z) = f’(z), so that: 
ro — f'&): ==. 


If z = a is a zero of order k of f(z), we have by (7) and (8): 





yla) = v'(a) = ... = pa) = 0 and ya) #0, 





ie. z = a is a zero of order (k — 1) of y(z) or, what amounts the 
same thing, of f’(z), i.e. a zero of order k of a given polynomial is a 
zero of order (k — 1) of its first derivative. It can be seen by using this 
property successively that it is a zero of order (k — 2) of the second 
derivative, a zero of order (k — 3) of the third derivative, etc., and 
finally, a single, or simple, root of the (k — 1)-th derivative. 

Thus, if f(z) has the factor form: 


f (2) = ag (2 — 2)? (2 — z^... (2 — Zm)", (9) 
f'(z) will have the form: 
f(z) = (2 — 27 (2 — 237... (e — 2m)" c (2), (10) 
where œ(z) is a polynomial with no zeros in common with f(z). 


187. Horner's rule. We now give a practical rule for the evaluation of 
f(z) and its derivatives for a given z = a. 

Let the division of f(z) by (2 — a) give a quotient f,(z) and a remainder 
rı; let the division of f,(z) by (2 — a) give a quotient f,(z) and a remainder 
7r, and so on: 

He) = (z-—a)fi2-crs ri = fla); 
fie) = (z — a) f(z) +15 rs = fia); 
Salz) = (2 — a) falz) + Ty 73 = fea); 


We write (6) in the form: 








f (a) (z — a7] 


ni 


10=10+6-0 [LO + Eh at. + 
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We compare this formula with the first of the equations written above, 
and obtain: 


fo do 








nto D LET a) +... IE e cay n =t a). 


We find, on treating f,(z) in the same way: 


Po puo bs ¿O (2) f (a) 


(2-0 73% r, = i 





1, (2) = 


and in general: 
a (a) 





(k= 1,2,..., n). 


Tk = 


We now put: 
f(z) =a" --a, 277! E... E Mp2 F ag 
fy (2) = b, 277 b 277? A b az dua; bp = Ty 


and show how the coefficients b, of the quotient, and b, of the remainder, 
can be evaluated. We remove the brackets and collect terms in the same 
power of z, and obtain the identity: 


as £" + a,27 7 A guy F ag = (2 — a) (b,27 3 + b 277? AS 
+ Baza 2 + bua) HOn = by 2" + (b, — bya) 2771 + (b, — ba) 27-7 --...4- 
+ (bai — bn-2 8) 2 + (bn — bn-1 0), 
and comparison of the coefficients of like powers of z gives: 
ay = bo; a, = b, — by a; a, = b, — by a; ...3 Any = bn- — bn 2 5; 
An = b, — b, _ a, 
whence: 
bo = a; b, = boa + ai; b, = bia + a3 ...3 b oa = bn 2a + anai 
bn = b, 18 + an = nu. 
These equations enable us successively to calculate the b,. 
Similarly, on writing the quotient and remainder of the division of f,(z) 
by (z — a) as: 
fa (2) = c2"? Hoye Fe. A A 
we obtain: 
Cy = bo; c, = Oya + bi; Ce = Cpa + Des... Cra = Ong @ + Ong} 
Cni = Cp @ + bn my, 


ie. coefficients c, are calculated successively with the aid of the b,, just as 
the b, were calculated with the aid of the a,. 

This method of calculation is called Horner’s rule. 

We obtain the quantities fO(a)Jk t on applying this rule. 


187] 
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The calculation is set out in the table below, which is self-explanatory. 






































By Gp üy Gay o...) Og»  Üp-p An 
+ ba, ba, bya, ..., dy.32, by, 30, baa 
by = Ay, By, da bp -> by by. b, = r, = f (a) 
+ Co, C,U, CoA, e., Cyg@, Cp 90 
f (a) 
Cy = Gp Cp Car Cyr so Come Cpa = n= 
f"? (a) 

d a= Tna = 2 
+ Mot 

- E (a) 
My = a, m, = Trh = (n — 1j 
m — dca) 

9 m! 





Example. To find the values of 


f(z) = 25 + 224 — 22? — 252 + 100 























and its derivatives for z — —5. 
a= — 5 l, 2, 0, = 2, que 25, 100 
— 6, 15, — 175, 385, — 1800 
l, — 3, 15, — 77, 360 — 1700 = f (— 5) 
— 5, 40, —275, 1760 
n Moe 
1, —8 55, —362 2120 = LCD 
— 5, 65,  —600 - 
"(—5 
1, —13, 120 962 = PGS) | 
— 5, 90 s 
J^ (— 5) 
1, —18 210 = q ^ 
— 5 








iv) = 5) 
4 fiw 


E 











f? (-5) 
r= 5 
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188. Highest common factor. We take two polynomials f, (2) and fa(2), 
each of which must have a determinate factor of the form (3). The 
highest common factor of the two polynomials is defined as the product 
of all the linear factors of the type (2 — a) that appear in the expan- 
sions of both f,(z) and f,(z), each individual factor being raised to the 
lower of the powers to which it is raised in the expansions concerned. 
Constant factors have no part in the composition of the highest com- 
mon factor. The highest common factor of two polynomials is a third 
polynomial, the zeros of which are in common with those of the first 
two, each zero having an order equal to the lower of the orders with 
which it appears in the two original polynomials. If the two original 
polynomials have no common zeros they are said to be relatively 
prime. The highest common factor of several polynomials can be 
defined in precisely the same way. 

A knowledge of the expansion into linear factors of the given poly- 
nomials is essential, if the highest common factor is to be obtained 
by the above method. Yet obtaining the expansion (3) amounts to 
solving the equation f(z) — 0, and this represents one of the basic 
problems of algebra. 

The highest common factor can be obtained by another method, 
however, similar to that used in arithmetic for finding the highest 
common factor of integers; this is the method of successive division 
[Euclid's algorithm], which does not require factorization. Let the 
degree of f,(z) be not lower than that of f,(z). We divide the first poly- 
nomial by the second, then the second, f,(z), by the remainder from 
the first division, then this first remainder by the remainder from 
the second division, and so on, until we finally reach a division with 
a remainder equal to zero. The last non-zero remainder is the highest 
common factor of the two given polynomials. If this remainder does not 
contain z, the given polynomials are relatively prime. Thus, finding 
the highest common factor amounts to division of the polynomials, 
arranged in decreasing powers of the variable. Division of f,(z) and f,(z) 
by D(z), their highest common factor, gives us relatively prime poly- 
nomials, one or both of which cannot contain 2. 

We see by comparing expansions (9) and (10) that the highest 
common factor of f(z) and its derivative f’(z) is: 


D(z) = (2 — 2^7 (2 — 2,77... a — en), 


constant factors being omitted as of no account: 
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We obtain on dividing f(z) by D(z): 


1 (2) 
D (z) 





= Mg (2 — 21) (Z — Za)... (Z — Zm)» 


ie. division of a polynomial f(z) by the highest common factor of f(z) 
and f'(z) gives a polynomial, all the zeros of which are simple, and coinci- 
dent with the zeros of f(z). 

The operation of obtaining this polynomial is referred to as ridding 
f(z) of multiple zeros. We seo that this is achieved without needing 
to solve the equation f(z) — 0. 

If f(z) and f'(z) are relatively prime, all the zeros of f(z) are simple, 
and conversely. 


189. Real polynomials. We now consider the polynomial with real 
coefficients: 


fo) = age tae -- ... 4- as 42-05, 


and we suppose that it has a complex zero z = a + bi (b #0) of 
order of multiplicity k, i.e. 


fla + bi) = f'(a + bi) =... — f? + bi) = 0; 
f%a + bi) = A+ Bi 4 0. 


We now replace all the quantities in f(a + bi) and its derivatives 
by their conjugates. The real coefficients a, will remain unchanged in 
this process, whilst (a + b) becomes (a — bi), i.e. the polynomial 
f(z) remains as before, except that its z = a + bi is now replaced by 
z = a — bi. We know from [173] that the total effect of replacing 
complex numbers by their conjugates is to obtain the conjugate of 
their result, i.e. we get the conjugate of the polynomial. Thus: 


fla — bi) = f'(a — bij =... = f (a — bi) = 0; 
f(a — bi) = A — Bi 40. 


i.e. if a polynomial with real coefficients has a complex zero z = a + bi 
of order of multiplicity k, it must have the conjugate vero z — a — bi 
of the same multiplicity. 

The complex zeros of a polynomial with real coefficients are thus 
distributed in pairs of conjugate zeros. Suppose that the variable z 
only takes real values, and let these be denoted by x. In accordance 
with (3): 





f(x) = aya — 2) (x — z3) ... (1 — Zn). 
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If there are imaginaries among the zeros z, the corresponding fac- 
tors will also be imaginary. Multiplication of pairs of factors corre- 
sponding to pairs of conjugate zeros gives: 


[z — (a + bi)] [2— (a — bi)] = [(x — a) —bi] (e — a) + bi] 
= (1 —-aQ+b=xtprtg, 


where 
p = —2a; q = a? +0 (b s 0). 

Pairs of conjugate zeros give real factors of the second degree, so 
that we can state the proposition: a polynomial with real coefficients 
can be factored into real factors of the first and second degrees. 

This expansion has the form: 


Ha) = age — a)" (x — a)... (x — m) (3 + pim + g)" x 


X (H pad). (a+ q^, (13) 


where 7, %,..., %, are real zeros of f(x) of multiplicities k,, k,. . ., krs 
and the factors of the second degree result from pairs of conjugate 
complex zeros of multiplicities /,, la, ..., lp 


190. The relationship between the roots of an equation and its 
coefficients. Let the roots of the equation: 


n n-1 
az + 042 +... +4n_¡2+ 4=0 


be 2,, 2%, ..., Zn as before. 
By (3), we have the identity: 


aZ" fay 2 4... + a4 12 + An = A(z — 24) (2 — %) ... (2—2n). 


We can apply to the right-hand side the familiar rule of elementary 
algebra for multiplying binomials that differ in their second terms, 
and write our identity as: 


ag ag" +2. bag ea = 
= dy [2 — BI + Sun 7? +... + (YS * +... + (—1)" Sal, 


where S, denotes the sum of all the possible products obtained by 
taking k of the z, together (s = 1, 2, ..., n). We find on comparing 
coefficients of the same powers of z: 


—. — A, A = k Qk , z ies a, 
A (E, 
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or in explicit form: 


£g d- Z3 d ss. Lu aui ur 
a. 
ZiZa + 2923 + aa: PA cau (12) 


21%... 2 =(— 1)" 24. 


These formulae are a generalization of the familiar properties of the 
roots of quadratic equations to the case of equations of any degree. 
They enable us, incidentally, to construct an equation, given its 
roots. 


191. Equations of the third degree. We shall not pay detailed attention 
to the actual calculation of the roots of algebraic equations. This subject 
is dealt with in text-books on approximate evaluation. We dwell only on 
the case of equations of the third degree, as well as sketching some methods 
of calculation that will be useful in future. 

We start with an investigation of the third degree equation: 

y + ay? + ay + a, — O. (13) 

We introduce a new variable x instead of y, putting 

y=x+a. 

Substitution of this in the left-hand side of (13) gives us: 

x? + (3a + a,) x? + (3a? -+ 2a, a + a) x + (a? + a, @ + aza + a) = 0. 

If we put a = —(a,/3), the term in x? falls out, so that the substitution 


wie cc 
ES 
changes (13) to the form: 
f(x) = x? + pre+q=0, (14) 


which does not contain a?. 

If p and q are real, (14) must have either three real roots, or one real and 
two conjugate complex roots [189]. We decide which of these cases is 
in question by taking the first derivative of the left-hand side of the equation: 


f'(x) = 3a? + p. 


If p > 0, f'(x) > 0, and f(x) is always increasing; it will only have one 
real root, since its sign changes from (—) to (++) on passing from x = — oo 
to x = + œ. Now take p < 0. It can easily be seen that f(x) has a maximum 


at x = —Y/—p/3 and a minimum at x= Y—/3). Substitution of these 
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values of 2 in the expression for f(x) gives us the respective maximum and 
minimum values of the function: 


2p p 
EE 


If both these values have the same sign, i.e. 
i 2) (a+ 2p P| PE so 


x + >0, (15,) 


or 


the equation has only one real root, lying in either the interval 


E 173) om (3) 


On the other hand, if the above maximum value of f(x) is (+) and the 
minimum is (—), i.e. 








f+ <0, (15,) 


the signs of f(— œ), f(—| —2/3). f(+ || —p/3), f(+-0) are respectively (—), 
(+), (—), (+), and (14) has three real roots. We further remark that condition 
(15,) is certainly satisfied when p > 0. We leave it to the reader to show 
that, for 


2 3 
gapo as 


(14) has a multiple root + y—»/3 and a root 3g[p, taking p 3*0, and so, 
from (15,), p < 0. We have inequality (15,) when p = 0 and q #0, and 


3 
equation (14) takes the form x? + q = 0, ie. a= yY—a. whence it follows 
that (14) has one real root [175]. With p = q = 0, (14) becomes: x? = 0, 
and has the root of order three x — 0. 
The results obtained are collected in the following table: 








a+ pz 4-gq-—0 








2 a 

T + 5 >o0 One real and two conjugate complex roots 
gp 

T + 27 < 0 Three real and distinct roots 











2 3 
i + 5 = 0 Three real roots, one of which is repeated 
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Figure 182 shows the graphs of the function 
y =£ + po tq 


for the various assumptions regarding (g*/4 + p*/27). The double root in 
case (15,) corresponds to the point of contact of the curve with OX. 

We now deduce a formula expressing the roots of equation (14) in terms 
of its coefficients. Whilst this formula is not convenient for practical calcula- 
tions, a suitable practical method is derived from it in the next article, 


making use of trigonometric functions. 
We introduce two new unknowns u and v in place of x, putting 


z=u-+0w. (16) 
We substitute in (14): 
(u +v) + plu +v) tg=0, 
ut + v* + (u + v) Gw + p) + q — 0. (17) 
Wo subject u and v to the condition: 
Juwv+p=0, 
when (17) gives us: 
u? + v? = —q. 
The problem thus reduces to the solution of the two equations: 
=P. = 
uoc — "33440 =-4. (18) 
On raising both sides of the first equation to the third power, we have: 
3 P. 3 
ui == — 575 u? + y= — gq, 


so that u? and v? are roots of the quadratic equation: 








pi 
2 — = 
ag — 97 = 9, 
i.e. 
3 BS 
DO A EA -V-a s. rn 19 
" y ER zi 2 4+ 37° 09) 
We finally obtain, from (16): 
3 8 
=| -LL |Ë LA 2 2,” 20 
22] ig +) 2 parar (20) 


This formula for solving the cubic equation (14) is named after Cardan, 
a sixteenth century Italian mathematician. 
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We use R, and R, to denote, for brevity, the expressions under the cube 
root signs in (20): 


x= VB, + YR, 


Each of the cube roots has three distinct values [175], so that the formula 
written gives in general nine distinct values of x, only three of which can be 
roots of (14). The extra values of x 
Y are due to the fact that we cubed the 
first of equations (18). We can only 
consider those values, for which u 
and v are connected by the first of 
equations (18), i.e. we must only take 
the values of the cube roots whose pro- 
ducts are equal to (—p/3). 
We denote by e one of the values of 
the cube root of unity: 








Qn |, 2a 1,y3. 
E = cos -y + tsin -y — — y + yt 
El cos y isin E _1_¥ 
37 37 3 9° 


3 3 
and we lot VR, and yr, denote values 
of the roots that satisfy the above 
Fro. 182 condition. On multiplying these by e 
and e? we obtain all three values of 

the root [175]. 

If we recall that s? = 1, we get the following expressions for the roots 
of equation (14), in which p and q are taken to be any complex numbers: 





3 3 Bs. HEN 3 o 3 
Tı = YR, + YR, ;*.— VR, + e YR,; T= ey R, +e VE, . (21) 


192. The trigonometric form of solution of cubic equations. We assume 
that the coefficients p and q in equation (14) are real. It has already been 
mentioned that Cardan's formula is inconvenient for calculating the roots, 
and we now deduce a more practical formula. We consider four separate 
cases. 

YP 
1. ¿+3 <9? 
It follows from this that p < 0. The expressions R, and E, under the 


radicals in (20) will be imaginary, yet in spite of this, all three roots of the 
equation will be real, as we know [191]. 


192] THE TRIGONOMETRIC FORM OF SOLUTION OF OUBIO EQUATIONS 495 


We put: 
-44 A $- isi 
gt +37 = y E? 4 37 = (cos p + ising), 
whence [171]: 
r= P. cos p = — L.. (22) 
27” 2r 


We have by Cardan’s formula: 


3_ 
a= Vr (cos LEY + isin PEA) 4 


3_ 
Yr (cos 27. — i sin 24207) (k — 0, 1 2). 
If we take the same values of k in both terms, we obtain for the products 
of these terms the positive number: y = —p/3. 
We have finally: 


2 — 27 oos LEA (k — 0, 1, 2), (23) 


where r and ¢ are defined by (22), and where it can easily be shown that, if 
we take different p satisfying the second of equations (22), we get the same 
set of roots from (23). 


Li? 
2. 4 +7 > 0 and p<0. 
Equation (14) has one real, and two conjugate complex, roots [191], 
whilst it follows from the conditions written that —(p3/27) < (q?/4). We 
introduce the auxiliary angle w, putting: 


PL 24 
y 97 = y sino. (24,) 





This gives us: 


VET += a epe 
VE - ¡ra [72 ra 


since, by (24,): 
3 
|-2- Jas o. 
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We finally introduce the angle p, where 


tan p = [ten E w, (24,) 

and get the following expression for the real root: 
AN. 2-5 (25,) 
z= - |- (ang + cot p) = 1 


We suggest that the reader make use of (21) to show that the imaginary 
roots will have the form: 


1 p MY 
E a Ue 
3. 4 +37 >0 and p>0. 


In this, as in the previous case, (14) has one real and two conjugate com- 


plex roots. The value here of Y»3/21 can be both greater and less than 
| 2/2 |, and we take an angle œ defined by the following, instead of by (24,): 


p 


1 


This gives: 


-RYyms [m E Jaks, 
\-4 uk + cos c ana 


3 


3 
=> goost} w 
zt a e A = 2 
y 2 |g- COS @ jur 


We introduce a new angle 9 defined by: 











tan 9 = fran E w, (26,) 
and finally have: 


The imaginary roots will be: 





n ip 21 
g 00629 E y (21) 
3 
4. A 
4 tw 


Equation (14) has a repeated root, and here, as in the case of p = 0, 
the solution presents no difficulty. 
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The roots of a cubic equation can be calculated with considerable accuracy 


by using the trigonometric formulae deduced, together with logarithmic 
tables. 


Examples. 1. 
23 + 9x? + 23x + 14 — 0. 
We put z = y — 3 and reduce the equation to the form: 
y? — 4y —1-0, 
which has three real roots [191]. 


We obtain cos p from (22), and having found g itself, we determine the 
roots from (23): 





97 z 
cos p = yen ; logy, cos g==1,51156 





p = 71°2/56" 





$T = 2324059" ; A = 14324059" ; Él = 2634059" ; 





log;, a = 0.36350 


logy) ¥1==0-32529; log,, (—y)=0.26970; 108,0 (—y,)= 1.40501 


Yı = 2.1149; y, = —1.8608; yz = —0.2541 








aw, = —0.8851; x, = —4.8608; vy = —3.2541 
2. 
x3 — 3x 4-52 0. 


We determine w from (24,) and y from (24,), then calculate the roots from 
(25,) and (25,): 





logy, sin œ = 1.60206 ; w = 2393411" ; Y @ = 1194720" 





log, tan p = 1.77009 ; p =30%29'47"; 2g = 60959734" 








logie ap = 0,05821 ; = 1.1434 


1 
in 2p 





log, V — p cot 2p = 1.98244; Y — p cot 2p = 0.96037 





æ, == —2,2868 ; m, Zy == 1.1434 + 0.96037 i 
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193. The method of successive approximations, It is possible in many 
cases conveniently to improve upon an approximate value x, of a required 
root ¿, when zx, is known to a certain number of decimal places. One such 
method of correcting an approximate value of a root is that of successive 
approximations. As is explained later, this method is suitable for transcend- 
ental, as well as algebraic, equations. 

We suppose that the equation 


Fu) =0 (28) 
has been rewritten in the form: 
flo) = fax), (29) 
where f,(x) is such that the equation 
f(x) = m 


has one real root, easily calculable with great accuracy, for any real m. 
Evaluation of the roots of equation (29) by the method of successive ap- 


Y 






4-50) 





X, X3 È X4 X2 
Fic. 183 Fic. 184 


Xo 


proximations proceeds as follows: we substitute an approximate value x, 
of the required root in the right-hand side of (29) and find a second ap- 
proximation z, to the root from the equation: 


f(x) = f(x). 


We substitute x, in the right-hand side of (29) and get the next approxi- 
mation z, by solving the equation f,(x) = f,(x,), and so on. We thus find a 
sequence of values: 

Los Uy, Xe, 020, ps vee ys (30) 
with 
SAE) = falo) fi) = foley); o finn) = finas --- (31) 


The geometrical significance of the approximations obtained is easily 
shown. The required root is the abscissa of the point of intersection of the 
curves: 


y = fix) (32,) 
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and 
y = faz). (32,) 


The curves are illustrated in Figs. 183 and 184, the derivatives fj(z) and 
f¿(2) having the same sign at the point of intersection in the case of Fig. 
183, and different signs in Fig. 184, whilst in both cases 


1/200 | < LFS) |. 


The following construction corresponds to equations (31): we draw the 
straight line x = x,, parallel to OY, to its point of intersection (x, Yo) with 
the curve (32,); we draw through this point the line y = y,, parallel to OX, 
to its point of intersection (x,, yg) with curve (32,); now we draw through 
(v, Yo) the line = æ, parallel to OY, to its point of intersection (x,, y) 
with the curve (32,); next we draw through this last point the line y = y, 
to its intersection at (a, Yı) with the curve (32,), and so on. The abscissae 
of the points of intersection give us the sequence (30). 

If the first approximation is taken sufficiently close to £, this sequence 
tends to ¿as a limit, as is clear from the figure; in the case where LE) and 
f,(4) have the same sign, a step-line tending to ¿ is obtained (Fig. 183) 
whereas when f;(¢) and f;(£) have different signs, we get a form of rectangular 
spiral tending to £ (Fig. 184). We shall not go into the condition for, and rigor- 
ous proof of, the fact that sequence (30) tends to é as a limit. This can be 
observed directly from the figure in many cases. 

The method described is particularly convenient to apply in the case 
when equation (29) has the form: 


Let £ be the root of this equation, and let an approximate value of it, 
XQ ith, 


be known. 
The sequence of approximations will be: 


&, = falo); Xe = fimi), -3 En = fall)... 


It can be shown that in fact zx, > £ as n > oo, if the function f(x) has 
a derivative f¿(w) which satisfies the condition: 


\fz)|<@ <1, 
when 
¿E=zh<xr<¿+th. 


Examples. 1. We take the equation 
x5 — x — 0.2 = 0. (33) 
Its real roots are the points of intersection of (Fig. 185): 
y= x, (34 1) 


y =x + 0.2 (34,) 
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and (33) has one positive and two negative roots, as can be seen from 
Fig. 185. 

At the points of intersection 4 and B, corresponding respectively to the 
positive root and to the negative root with the greater absolute value, the 
absolute value of the slope of the straight line (34,) is less than the absolute 
value of the slope of the tangent to curve (34,) ie. calculation of these 
roots by the method of successive approximations must proceed with equa- 

tion (33) set in the form: 


æ = v + 0.2. 


We take x, = las a first approxi- 
mation for calculating the positive root, 
and obtain the table: 





5 pct 
Vin + 0,2 Zn + 0.2 





1.2 
2, = 1.037 1.237 

2, = 1.0434 1.2434 
g, = 1.0445 1.2445 





x, = 1.04472 





Fie. 185 = 


With z,, the required root is obtained 

to an accuracy of four decimal places. 

We evaluate the negative root with the greater absolute value by taking 
x= —l as a first approximation. 


5 


In + 0.2 2 + 0.2 








—0.8 
s, =—0.956 | —0.756 
a, = —0.9456 | —0.7456 
æ, = —0.9430 | —0.7430 
a, = —0.9423 | —0.7423 
m, = —0.94214| —0.74214 
w, = —0.94210 





The error in this case does not exceed 2x 1075, 

At C, corresponding to the negative root with the smaller absolute value, 
the absolute value of the slope of the tangent to the curve (34,) is less 
than unity, so that use of the method of successive approximations implies 
setting (33) in the form: 

x= ab— 0.2. 
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We take x,=0 as a first approximation, and obtain: 








ap — 0.2 A 
—0 
2, = —0.2 — 0.00032 
m, = — 0.20032 








The approximation x, gives the root to an accuracy of five places. The 
approximation to the root proceeds by a step-line in all three cases, as illus- 
trated in Fig. 183, and as is easily verified from Fig. 185; also, x, tends mono- 
tonically to the required root with increase of n in all three cases. 

2. 

x = tana. (35) 


The roots of this equation are the points of intersection (Fig. 186) of 


y =x, y = tang, 














Fie. 186 


and, as can be seen from the figure, the equation has one root in each of the 
intervals 


[en -)$. Qn41) 3) 


(n —0, +1, +2, ...). 
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The following approximation will hold for positive roots: 
a, ^» (2n +1) 5 


where a, denotes the nth positive root of equation (35). 
We evaluate the root a,, close to 32/2. We apply the method of successive 
approximations after rewriting (35) in the form: 


g = arc tan gz, 


and we take x, = 32/2 as the first approximation. 
When calculating the sequence of approximations 


Ln = arc tan 2a, 4, 


we must always take the value of the arc tangent lying in the third quadrant. 
We obtain, using logarithmic tables and expressing the arcs in radian measure: 


£o = 4.7124; xı = 4.5033; 
£, = 4.4938; £ = 4.4935. 


194. Newton’s method. The process of successive approximations shown in 
Figs. 183 and 184 consists in approximating to the required root by means 
of lines parallel to the coordinate axes. We now give some other similar 
processes, in which use is made of straight lines inclined to the axes. 
Newton’s method is one of these. 

Let x, and x, be approximate values of the root é of the equation 


f(x) = 0 (36) 
and let £ be the only root of the equation in the interval (xj, x,). Graphs 
of 

y = f(z) 
are illustrated in Figs. 187 and 188. 

The abscissae of points N and P are the approximations xj and æ, to 


the root 5, which is given by the abscissa of A. The tangent PQ, to the curve 
is drawn at the point P, and from the point of intersection Q, of this tangent 


with the axis of abscissae we draw the ordinate Q.Q of the curve; at the point 
Q we draw the tangent QR, to the curve and from the point R, draw the 
ordinate R,R 12 of the curve and so on. 

It is dicar from the figure that the points P,, Q,, P, ... tend to A, so that 
their abscissae, z,2,,2,... form a sequence of approximations to ¿. We 
deduce a formula, expressing x, in terms of £p- 

The equation of the tangent PQ, is: 


Y — f(xy) = f(x.) (X — v). 
We find the abscissa of Q, by substituting Y = 0: 


m f (y) 
Tı = Ly — Fa)’ (37) 
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and in general: 


Fer) en 





Tn = Cni 


(n= 1,2,3...) 


We have simply observed by inspection of the figures that the xz, are 
approximations to the root é, the figures being drawn for the case when 
f(z) is monotonic and remains convex (or concave) in the interval (x, Lo), 


y 









N, (xp) 
R, (x2) 8, x) P (Xo) 





p 
Fra. 187 Fic. 188 


in other words, when f'(x) and f"(x) preserve their sign in the interval [57, 71]. 
We shall not dwell on a rigorous analytic proof. 

We remark that if we applied Newton’s method to the end x, instead of 
to z,, approximations to the root would not be obtained, as is indicated by 
the tangent drawn with a broken line. In the case of Fig. 188, the curve 
becomes concave in the region of positive ordinates, i.e. f”(x) > 0, and 
Newton's method has to be applied at the end where f(x) 0, as we have 
seen. It follows from Fig. 187 that, with f”(x) < 0, Newton's method must 
be applied to the end where the ordinate f(x) < 0. We thus arrive at the 
following rule: 4f f'(x) and f"(x) do not vanish in the interval (zy, 2,), whilst 
the ordinates f(x.) and f(x») have different signs, application of Newton’s method 
to the end of the interval at which f(x) and f"(v) have the same sign gives us a 
sequence of approximations to the unique root of equation (36) contained in 
the interval. 


195. The method of simple interpolation. We mention one further method 
of approximate evaluation of roots. We draw a straight line through the 
ends N, P of the are of the curve. The abscissa of the intersection B of this 
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line with axis OX gives an approximate value of the root (Fig. 189). As before, 
let xq, xz, be the abscissae of the ends of the interval. The equation of NP 
is: 
Y —f(2) = X— 2 
f @o) — f (x) To — Ly 


Wo find the expression for the abscissa of B by putting Y = 0: 


wof (zy) — Tof (20) 
(20) — f (20) 


r (Zo — 29) f (x9) 


FA) — Fe)” 


AE (2, — o) Í (T0) : 
° f(z) — f (9) 

Replacing the segment of curve by the straight line passing througb its 
ends is equivalent to replacing the function f(x) in the interval by a first 
degree polynomial having the same end values as f(x); or alternatively, 
and this amounts to the same thing, it is equivalent to assuming that the, 
variation of f(x) in the interval is proportional to the variation of x. This 
method is usually referred to as simple interpolation, and is applied, for in- 
stance, when using logarithmie tables (proportional parts). Tt is also sometimes 
called the rule of false position (regula falsi). 

If simple interpolation is used simultaneously with Newton's method 
the possibility arises of approximating both limits xp and x, to the root £, 


or 
(39) 


or 








Fra. 189 Fra. 190 


Suppose, for example, that f(x) and f”(x) have the same sign at the end 
£o 80 that Newton’s method has to be used with reference to this end. 
The use of both methods gives us two new approximate values (Fig. 190): 


NM xof (Lo) — xaf (xi) . 
2) — ie ' 
2, — a, f (£o) 


0 Fæ) 
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We can again apply the same formulae to the approximate values zi 
and z,, and obtain the new values: 


v ila) af (e). 
e= Fe) ie C 








f (23) 
Wy G4, — LN d 
F (m) 
We thus obtain two series of values: 
t 1 1 , 
Lg, Vy, Vas -ers Eps oes 
and 
Lor Wis Was oes TN, 


approximating to the root ¢ from the left and from the right. 
If x; and x, coincide to several decimal places, the root £, lying between 
2, and zn, must be obtained to the same number of decimal places. 
Example. The equation 


f(z) = æ —x2—0.2=0, 


which we considered in Example 1 of [193], has one positive root in the inter 
val 1 « x < 1.1, and 

f'(x) = 5x* — 1 and f(x) = 20a? 
do not change sign in this interval. We can therefore put 


s= l; 2 =11. 


We calculate the values of f(x): 


f) = —0.2; f(1.1) = 0.31051 , 


from which it is clear that f(x) and f"(v) have the same (plus) sign at the right- 
hand end (z, — 1.1), so that it is in regard to this end that Newton's method 


must be applied. 
We firstly evaluate f'(x) at the right-hand end: 


f' 0.1) = 6.3205. 


We have by (37) and (39): 








; 0.1x 0.2 
ae TTT — eda 
0.31051 
2 = L1 — Se = LOL. 


We evaluate for the next approximation: 


f(1.039) = —0.0282; f(1.051) = 0.0313; f/(1.051) = 5.1005 , 
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whence: 
; 0.012 x 0.0282 
£ = 1.039 + — 0.0895 = 1.04469, 
0.0313 
z = 1.051 — 5.1005 ^ 1.04487 , 


which gives the value of the root to an accuracy of five places within two 
units [193]: 
1.04469 < € < 1.04487. 


§ 19. Integration of various functions 


196. Reduction of rational fractions to partial fractions. We described 
above a number of methods of evaluating indefinite integrals. We 
supplement this description in the present section and give it a more 
systematic character. The first problem is that of integrating rational 
fractions, i.e. the quotients of two polynomials. Before we turn to 
the solution of this problem, we establish a formula which represents 
a rational fraction as the sum of several fractions of the simplest type. 
This is known as the reduction of a rational fraction to partial fractions. 

Let us be given the rational fraction: 

F (2) 
fe) 

If it is an improper fraction, i.e. the degree of the numerator is 
not lower than that of the denominator, we can divide out an integral 
part consisting of the polynomial Q(x) and write the fraction in the 
form: 





F(x) _ 9 (x) 


where g(z)/f(z) is now a proper fraction, the numerator of which is of lower 
degree than the denominator. We shall further assume that this latter 
fraction is irreducible, i.e. that the numerator and denominator are 
relatively prime [188]. 

Let x = a be a zero of order k of the denominator: 


fle) = (œ — ay f(x) and fila) 40. 


We show that the fraction can be written in the form of a sum: 








(r—aYf,(z) (ea ^ (æa f (v) 
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where A is a constant and the second term on the right-hand side is 
a proper fraction. 
We form the difference: 
g (2) A  ...9(2) — Af, (2) 





(x — a)* f, (x) ear  (a—a)* f, (a) 


and define A in such a way that the numerator of the fraction on the 
right-hand side of the equation written is divisible by (x — a) [184]: 


pla) — Afla) = 0, 
whence 





A-4059. 

We can cancel (x — a) in the right-hand side just mentioned with 
this choice of A, and we thus arrive at identity (2). This shows that 
separation of the term of the form A/(« — a)", which is known as a 
partial fraction, enables us to lower the power of the factor (x — a)" 
appearing in the denominator by at least unity. 

Suppose that the denominator can be written in the factor form: 


f(x) = (s — ay) (e — a)^ ... (e — agn. 


We do not write a constant factor, since this can be divided into 
the numerator. Repeated application of the above rule for separating 
out the partial fraction gives us the reduction of a proper rational 
fraction to partial fractions: 




















vc) AP EE Aa + + AD 
f(x) (—a)^  (rz—oapg "7" A, 
AAA 
(r—aj)^ (x — ay)? a — à; 
SEGA. à ouk. (ru Ne EL aru. oe, loci La co... a 
a a, a g 
n Si ers (3) 
(2 — ag) (x — ag) v — Om 


Wenow give some methods of determining the coefficients appearing 
on the right-hand side of the identity written. If we clear this of the 
denominators, we arrive at an identity between two polynomials, 
and on equating corresponding coefficients, we get a system of linear 
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equations defining the required coefficients. As mentioned above [185], 
we call this the method of undetermined coefficients. 

A different approach can be used, in which we assign particular 
values to the variable z in the identity between the two polynomials 
mentioned above. This substitution method can also be used as a preli- 
minary to repeated differentiation of the identity (3). 

We shall not dwell on the proof of the fact that expansion (3) is 
unique, i.e. its coefficients have fully defined values, independent of 
the method of expansion. We give examples later of using these 
methods for finding the unknown coefficients of the expansion. 

Even when the polynomials p(x) and f(x) are real, the right-hand 
side of (3) may still contain imaginary terms, resulting from imaginary 
zeros of the denominator. We shall mention another expansion of a 
rational fraction which is free from this defect, though here we 
confine ourselves to the case when the denominator of the fraction 
has only simple zeros, since this case is of the most value in applications. 

The sum of partial fractions corresponding to a pair of conjugate 
complex zeros of the denominator, x = a + bi, is 


A+ Bi de A-Bi 
z—a— bi x—a-+ bi 


If we reduce these fractions to a common denominator, we get 
a partial fraction of the form: 


Mx+N 


Reta. PETRO 


We can thus reduce the real rational fraction in this case to real 
partial fractions: 











p(t) __ 4, 4s A 
f@) — z—a, a e == 
My +N, Mx +N, _ Mart Ns 4 
H 2 + pa +1 Ya + Pal + ds quete a? + pst + Qs’ (4) 


where the fractions in the first line correspond to real zeros of the 
denominator, and those in the second line to pairs of conjugate 
complex roots. 


197. Integration of rational fractions. Integration of a rational 
fraction leads by (1) to integration of a polynomial, which gives 
another polynomial, and to integration of a proper rational fraction, 
which we now consider. 
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If the denominator only has simple zeros, the problem reduces, 
by (4), to the two types of integral: 








A 
l. [soy ds Alog(2—a)+0 
and 
2 Ma+N 
3 z? + pz --q 
Recalling what was said in [92], we get a result of the form: 
Mz--N 2 2x --p 
Pp dx = å log (1? + pz + q) + parctan arcos +0 


The integral is expressed here in terms of a logarithm and an arc 
tangent. 

We now take the case when the denominator of the proper rational 
fraction contains multiple zeros. We return to expansion (3). Any 
imaginary numbers that may appear in it will only play an inter- 
mediary role in the subsequent working and will vanish in the final 
result. 

Integration of a partial fraction in which the denominator has a 
degree higher than the first yields another rational fraction: 

~ Ad, m AD, 
| dcos decer MR 

The sum of the fractions obtained after integration gives the alge- 
braic part of the integral, which, on reduction to a common denomi- 
nator, is clearly a proper fraction of the type: 


w (x) . 
(e — h)^7! (s — a)! ... (£ — a)? 





The numerator w(x) is a polynomial of degree lower by at least 
unity than the degree of the denominator, whilst the denominator 
consists of the highest common factor D(x) of the denominator f(x) 
of the integrand and its first derivative f’(x) [188]. 

The sum of the fractions not yet integrated: 

AP , AP 4 4 Aim 


za; € — ds € — Am 





gives on reduction to a common denominator a proper fraction of 
the form: 


w, (2) 
(© — ay) (@ — ag) -.. (2— Gp) ” 
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where @,(x) is a polynomial of degree lower by at least unity than the 
degree of the denominator, whilst the denominator consists of the 
quotient D,(x) of f(x) divided by D(x). We thus obtain the Ostrograd- 
skii-Hermite formula: 


zx oic 0, {X 
5-99 — t+ [at 6) 

We can find D(x) and D,(x) without knowing the zeros of f(x) [188]. 
We now show how to determine the coefficients of the polynomials 
a(x) and w(x), the degrees of which we can take as one less than the 
degrees of the corresponding denominators. We get rid of the integral 
signs by differentiating equation (5). We get rid of the denominators 
in the identity thus obtained, and get an identity between two poly- 
nomials; on applying to this latter the method of undetermined coeffi- 
cients or the substitution method, we can find the coefficients of 
w(x) and w(x). 

The Ostrogradskii-Hermite formula thus gives the algebraic part of 
the integral of a proper rational fraction, even in the case when the 
zeros of the denominator are unknown. The denominator of the frac- 
tion under the integral sign on the right-hand side of equation (5) 
contains only simple zeros, and we can evaluate the integral by reduc- 
ing to partial fractions, the result being expressed in terms of loga- 


rithms and arc tangents, as we have seen. We have to know the zeros 
of D,(x) for this last operation. 











Example. Using the Ostrogradskii-Hermite formula: 





_ aa? 4- Bz 4- y dx? +- ex +- n 
hers ofl Í ari Hh 


We differentiate with respect to æ: 





L___,_Q@ax +f) (+1) ~ eram + boty) y dot teeta 


l (a3 + 1)? 7 (x3 + 1)? a+ 1 


and obtain, on clearing fractions: 
1 = (200 + B) (z* + 1) — 3a(az* + pe + y) + (ôx? + ex + n) (2% + 1). 


We compare coefficients of xš and find 6 = 0, then compare coefficients 
of x? and find y = 0. We substitute y = ô = 0 in the identity written and 
compare coefficients of the remaining powers, and find: 


e—a=0; y —28 = 0; 2a +4 e= 0; B+ y7=1, 
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whence finally: 
1 2 
a=y=0=8e=0; Bazi "s 





so that 
x 


|y ^7 3(541 t ar: 
The last integral is evaluated by reducing to partial fractions: 


1 A " Me+N 
spl sv Hl | a?—az4l 


We clear fractions : 


1 = A(z? — x + 1) + (Mz + N) (e + 1). 








Substituting «= —1 gives A = 1/3, then comparing coefficients of 2? 
and the absolute terms gives: 
1 x52 
M = — 3) N= 3" 


and consequently, 
1 1 a—2 


a$--l 3(@+l) 3(2®—x2+l)- 





We finally get: 


jj ee. ae a 


M 


2r — 1 








1 1 
= y eg FD) — Glog (24104 qq aro tan yy EO. 
whence 
[y ++ tye da (a c de dj 
@+l? 3(@+h ^9 % g (eC 
+ E arc tan E +0 


198. Integration of expressions containing radicals. We consider 
some other types of integral, which reduce to integrals of rational 
fractions. 

1. The integral 








ar+by (az--b)" 
PR (SEB Je 
where R is a rational function of its arguments, i.e. the quotient of 
polynomials in these arguments, whilst A, u, ... are rational numbers. 


Let m be the common denominator of these fractions. We introduce 
a new variable t: 
ar+b 


a 
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Evidently, after this, x, dx/dé and the expressions: 
(E Foy (E + by 
co +d } : cz + 7) 
will be rational functions of t, and (6)reduces to theintegral of a ration- 
al fraction. 


2. Binomial differential. The integrals of binomial differentials : 
If 2" (a + ba”) dz, (7) 


where m, n and p are rational numbers, reduce in many cases to the 
integral (6). 
We put 2 = 2": 





1p BE 
[a (a + bard — 7 [£7 (a + bt? dt. 


If por (m--1)/nis an integer, the integral obtained is of the type (6). 
It follows from the evident equality: 


[ erm: (a + bt)? dt = [ emer (y dt 


= + : + p is an integer, integral (7) reduces to 





that in the case when 
the form (6). 

There is a theorem of Tchebysheff, according to which the above 
three cases exhaust all the cases in which the integral of a binomial 
differential is expressible in terms of elementary functions. 


199. Integrals of the type f R(x, yaz? + bx + c)dx. An integral of 
the form 
(R(x, Vaxt X be F c) de, (8) 
where R is a rational function of its arguments, leads to the integral 
of a rational fraction with the aid of Euler's substitution. 
In the case a > 0, Euler’s first substitution can be used: 


On squaring both sides of this equation and solving with respect 
to x, we get: 
g c2 ro 
2tYa 4- b ' 
from which it is clear that x, dzjdt, and az? + bx + c are rational func- 


tions of £, and therefore integral (8) leads to the integral of a rational 
fraction. 
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In the case c > 0, we can use Eulers second substitution : 


Yaa? + bx + c — tz + Ve. 


We leave it to the reader to verify this. 

In the case a < 0, the quadratic expression (ax? + bx + c) must 
have real zeros v, and z, since it would otherwise have the (—) sign for 
all real x, whilst Vaz? + bx +c would be imaginary. In the case of 
real zeros integral (8) leads to integration of a rational fraction with 
the aid of Huler’s third substitution : 








la (e — 2) (1 — 23) = t (8 — 23), 


which we suggest may also be checked by the reader. 
Euler's substitutions lead to complicated expressions for the most 
part, and we therefore give a second method of evaluating integral (8). 
For brevity, let 


y = laa? + bz + c- 


Every positive even power of y consists of a polynomial in z, and 
the integrand is therefore easily reduced to the form: 


— 93(2) + o (2) y 
E69) — se) ro Gy" 


where the ws(x) are polynomials in x. If we get rid of the irrational in 
the denominator and carry out elementary transformations, we can 
put the expression written in the form: 


1, (x) 
wa) y ` 


Ws (x) 
wy, (x) 





R(x, y) = + 


The first term is a rational fraction, which we know how to integrate. 
On dividing out w,(x)/w,(z) into an integral part and reducing the re- 
maining proper fraction to partial fractions we arrive at an integral 
of the form: : 


9 (x) 
[ras reps e (9) 
and at another of the form 


de 
(x — a)" yax F br +e’ 





(10) 


where (x) is a polynomial in z. 
We assume here that polynomials w,(x) only have real zeros. 
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Before we consider integrals (9) and (10) in general, we note two 
simple particular cases of integral (9): 
da — 
Vax? + bx +c 


-e(eenejeeisei)jeo @>0, ay) 


_= ; PES 
V= x + br +e = [5 (s — 3) 


Formula (11) is easily obtained with the aid of Euler's first sub- 
stitution, whilst we worked out (12) previously in [92]. 
Integral (9) can conveniently be worked out by using the formula: 


g (x) = 2 ds 
ims de = p(x) Vax Fetal > (13) 


+ bz +e 


gabi 





where y(x) is a polynomial whose degree is one less than that of g(x), 
and å is a constant. We shall not dwell on the proof of (13). By differ- 
entiating (13) and clearing fractions, we obtain an identity between 
two polynomials, from which the coefficients of y(x) and å can be 
determined. 

Integral (10) reduces to integral (9) with the aid of the substitution 














E 
z se 
Example. 
1 dz LEA dos 
2d Ys eFI z—1 
—g -+1 
= de = 
=fr- Pee Ya? —az+1 
—x+l 
= 1 Sy — AAA ll 
is: [| m 
But 
a2—e pl _ 1 
a—l ies i 


and therefore 


2—ao+l x dæ 


E E BR RUE eng —————— de NN rr i 
(x —1) Va? —2@ +1 Ya? — 4-1 = (a —1) y2 —a +1 
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In accordance with (13): 


: z dz =a y£ sp i+i cd 
nac Rd - EET =Q — g ee ga 
ji Y fr 
We differentiate this relationship and clear fractions, and obtain the 
identity: 
Qn = a(2z — 1) + 22, 
whence 


1 
a=]; => 


and therefore, by (11): 








IE yz! —a 0-145 J log(2— Ia 143 F1]) +0. 
On substituting 
1 
x—-l=-, 
t 
we obtain: 
EUR Lr Se MR 
resi ere Im 
= og (t+ F +18 FRA) 40m 
= 108 (2, TS lay u-i t Gp) +0= 
EPOD ere 
Finally: 
de — 1 
RA os (0 a + 


+ log (z 4142/20 4- 1) 4- C. 
Integral (8) is a particular case of an Abel integral, which has the form 
$ R(x, y) dz , (14) 


where R is a rational function of its arguments and y is an algebraic function 
of x, i.e. a function of x, defined by the equation 


f(z,y)-0, (15) 
where the left-hand side is an integral polynomial in x and y. If 
y= yP (2), 


where P(x) is a polynomial of the third or fourth degree in z, the Abel integral 
(14) is called an elliptic integral. We discuss these latter in the third volume. 
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They are not generally expressible in terms of elementary functions, and the 
same applies all the more for the general Abel integral. If the degree of the 
polynomial P(x) is higher than the fourth, integral (14) is called hyper- 
elliptic. 

If (15), expressing y as an algebraic function of x, has the property that 
z and y can be expressed in the form of rational functions of an auxiliary 
parameter £, integral (14) evidently reduces to the integral of a rational frac- 
tion. In this case, the algebraic curve corresponding to (15) is referred to as 
unicursal. In particular, Euler's substitutions serve to prove the unicursality 
of the curve: 


y? = ax? + bx +e. 
200. Integrals of the form f R(sin x, cos x)dx. Integrals of the form: 
$ R(sin z, cos x) dz , (16) 


where R is a rational function of its arguments, reduce to integrals 
of rational fractions on introducing the new variable 


1 
t= tan 5 a. 


We obtain, in fact, with the aid of well known trigonometric for- 


mulae: 
— 12 


oup ee 
sod 


sing = B 


2 o. 
1:5? 
and in addition, 
2dt 


= 2aretant; dr = Dao 





from which our assertion follows at once. 

We now notice some particular cases, where the working can be 
simplified. 

1. Suppose that R(sin v, cos x) remains unchanged on replacing 
sing and cosx by (—sinz) and (—cos x) respectively, i.e. that 
R(sin x, cos x) has period 7. 

Since 

sin x = cos v tan £, 


R(sin x, cos x) is a rational function of cos z and tan x, which remains 
unchanged on replacing cos x by (— cos 2), i.e. it contains only even 
powers of cos a: 


Risin x, cos x) = R,(cos? z, tan x). 
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It is sufficient in the present case to substitute 
t = tanz, 


in order to reduce (16) to the integral of a rational fraction. 
With this: 


dz = 





PR 
EB Cost 3 = 


1 

1+ 1+2° 

Thus, if R(sin x, cos x) remains unchanged on replacing sin x and 
cos z by (—sin x) and (—cos x) respectively, integral (16) reduces to 
the integral of a rational fraction with the aid of the substitution t = tan x. 

2. We now suppose that A(sin x, cos x) merely changes sign on 
replacing sin z by (—sin x). The function 

R (sin x, cos 2) 
sin x 

will be completely unchanged with this replacement, i.e. contains 
only even powers of sin x, so that: 


R(sin x, cos x) = R,(sin? x, cos x) : sin x. 
On substituting t = cos z, we get: 
$ R(sin x, cos x) dx = — [A — #,t) dt, 


ie. if R(sin x, cos x) merely changes sign on replacing sin x by (—sin x), 
integral (16) reduces to the integral of a rational fraction by the sub- 
stitution t = cos x. 

3. It can easily be shown in the same way that if R(sin v, cos x) 
merely changes sign on replacing cos x by (—cos x), (16) reduces to the 
integral of a rational fraction by substituting t = sin v. 


201. Integrals of the form f e^*[ P(x) cos bx + Q(x) sin bz ]da. Inte- 
gration by parts of an integral of the form: 


$ e^ g(a) dz , (17) 
where g(x) is a polynomial of the nth degree in æ, gives 
ferg (2) de = iem g (x) — = [om py (x) da. 


Having thus separated out from the integral a term in the form of 
a product of e and a polynomial of degree n, we are able to lower 
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the degree of the polynomial under the integral sign by unity. If we 
continue to integrate by parts, whilst noting that 


[emda = lom C, 


we obtain: 


$ e^ g(x) dz = e? y(z) + C , (18) 


where (x) is a polynomial of the same degree n as (x), i.e. a product 
of the exponential function e™ and a polynomial of degree n retains the 
same form on integration. 

On differentiating (18) and cancelling e** from both sides of the 
identity obtained, we can find the coefficients of the polynomial 
y(x) by the method of undetermined coefficients. 


We now consider the integral of more general type: 
f e"*[P(z) cos bx + Q(x) sin bx] dz , (19) 


where P(x) and Q(x) are polynomials in x. Let n be the greater of the 
degrees of these polynomials. If we use the expedient of introducing 
complex numbers, we can reduce integral (19) to integral (17) by sub- 
stituting for cos bx and sin bx in accordance with Euler's formula [176]: 


bxi __ ¿—oxi 
2 


ebxt 4- e xi e 


3 ; sin bx = 


cos bx = 
This gives: 
$ e“[P(x) cos bx + Q(x) sin bx] dz = 
m (ees (x) dx + ar (x) dx, 


where g(x) and q,(x) are polynomials of degrees not higher than m. 
Application of formula (18) gives: 


$ e**[P(x) cos bx + Q(x) sin bx] de = e+ pla) + e0 ya) +0, 


where y(x) and y,(x) are polynomials of degrees not higher than m. 
On substituting: 


e*9 — cos bz + isin bz, 
we finally get: 


f e" EP(z) cos bx + Q(z) sin br] dz = 


= e*[R(x) cos bx + S(x) sin bx] + C , (20) 
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where R(x) and S(x) are polynomials of degrees not higher than m. 
Hence we see that the integrand in (19) retains the same form after 
integration, with polynomials of degrees equal to the higher of the de- 
grees of the original polynomials. 

We differentiate (20), cancel e^" in the identity obtained, and 
equate coefficients of like terms of the form a? cos bx and 2 sin bx 
(s = 0, 1, 2, ..., n) on each side; we thus get a system of linear equa- 
tions defining the coefficients of the polynomials R(x) and S(x). 
We remark here that, if either cos bx or sin bx is absent from the 
integrand, both trigonometric functions still have to be written on 
the right-hand side of the formula, bearing in mind the rule given 
above concerning the degrees of polynomials R(x) and S(x). 

Integrals of the form: 


$ e? p(x) sin(a, x + 5j) sin(a, x + b) .. 
. cos(c, x + d,) eos (c4 x + da) .. 
reduce at once to integrals of type (19). 

In fact, by using the familiar trigonometric formulae that express 
the sums and differences of sines and cosines as products, we can 
conversely express the product of any two of the above trigonometric 
functions as a sum or difference of sines or cosines. Repeated appli- 
cation of this transformation enables us to reduce the number of 
trigonometric factors under the integral sign to one, and hence we 
obtain an integral of type (19). 

Example. By (20): 


$ 09 sin bx dx = e*(4 cos ba + B sin bx) + C. 
We differentiate and cancel e**: 


sin bx = (aA + bB) cos bx + (—5A + aB) sin bz, 





whence 
aA+bB=0, —bA+aB=1, 
i.e. 
b à 
^a lrb B= EFE 
and finally: 
fe% sin ba de = e (— = ES DEP i ss ji sinda) 4 C. (21) 
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Chapter I 


3. (a) -2<x<4;(b0) -1<x<0;(0)<— 3,7% > 1; (d) v >0. 
4, —24; —6; 0; 0; 0; 6. 
5. 1; 5/4; VI + 25; |z | ^ V1 + z51Jy1- z. 





6. Tu 0. 

8. nad f has i. 
9. 0.4. 

10. + (w+|z)). 


11. (a) —1 < x < + œ; (b) ~œ < x < + co. 

12. (—co, —2), (—2, 2), (2, œ) ie. everywhere except x = + 2. 
13. (a) —e «z«- V3, fg«z«co; (b)x=0, |x| > /2. 
14. —1<x< 2. 

15.—2 <x <0. 

16. —œ < z < —1, 0<x<l 

17. -2<x< 2. 

P E 2<x<+ o. 

19, —4«2« 

20.1 < x < 100. 

21. kx < x < (r3) (k= 0, £1, +2, ...). 

22. D(x) = 2x* — 52? — 10; y(x) = —32* + 6x. 

23. Dly(2)] = 27; y[P(x)] = 2 


24. x. 
25. (x + 2). 
80. — 2/2; 0; 2/4. 
31. (a) y = 0 if x = —1, y > 0 ìf z > —l, y < 0 if z < —l; 
(b) y = 0 if x = 2 and if z = —1;y > 0 if -l<x<2y=<0 


if —œ© < z < —1 and if 2 <x < + œ; (c) y >0 if —œ < 
< t «oo; (d) y = 0 if x = 0, — 3 ory3 , y > 0 if — y3 < x < 0 
o O y<0if —o<r<-—3andif0<x< 
<3; (e) y=0 if 2=1;y>0 if —o<x<-—l and if 
l<r<o y<0ifOo<zx=<l. 
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32. (a) «= + (y — 3) or TE and x 


= y +1) (-1< ee) ea da 
< y < ©); E w= 2- ie p « y « +); (e) z—-j-tan y 
1 
33.z = yif —œ < y < œ; x = Yy if 0 < y < +o. 
34. (a) y =u", u = 2x — 5; (b) y = 2", u = cos z; (c) y = logy, Y, 
u = tan v, v = lac (d) y = are sin u, u = 3*, v = —a?. 
35. (a) y = sin? x; (b) y = arc tan Vlogige ; (c) y = 22? — 1) if 
leona 0 UN 
36. (a) y = —cos a?, Vx < x < /2x; (b) y = log,(10 — 10%), —oo < 
<a< 1; ()y = 3 aif —co « x « 0and y =xif0<x< +o. 
81. n > Ife. (a) n > 4; (b) n > 10; (c) n > 32. 
82. n >1/e — 1 = N. (a) N = 9; (b) N = 99; (c) N = 999. 
83. ô = e[5 (e < 1). (a) 0.02; (b) 0.002; (c) 0.0002. 
84. (a) log x < —N for 0 < x < ó(N); 
(b 2* > N for x > X(N); 
(c) | f(z) | > N for |x| > X(N). 
85. (a) 0; (b) 1; (c) 2; (d) 7/30. 
86. 1/2. 87. 1. 88. — 3/2. 89. 1. 99. 3. 91. 1. 92. 3/4. 93. 1/3. 
94. 0. 95. 0. 96. 1. 97. 0. 98. co. 99, 0. 100. 1. 
101. 1/2. 102. 3. 103. 4/3. 104. 1/9. 105. — 1/56. 106. 12. 107. 3/2. 
108, — 1/3 


109. 1. 110. 11. 


1 
ia" We 
116. 0.117. (a) ;- sin 2; (b) 0. 118. 3. 119. 5/2.120. 1/3. 121. x. 122. 1/2. 
123. cos a. 124. —sin a. 125. 0. 126. — 1//3. 127. 1. 128. e”?, 129. e?. 
130. o^. 131. 074. 132. e*.133. e. 140. (a) x £ > z+ ka, where k is 
an integer; (b) x # k n. 143. A = 4.144. (a) f(0) = n; (b) f(0) = 1/2; 

(c) (0) = 2; (d) f(0) = 2; (e) f(0) = 9; (f) f(0) = 1. 














Chapter II 
3 3 
1. (a) 3; (b) 0.21; (c) 2h + k. 2. (a) 0.1; (b) —3; (c) Ja + h — Ya. 
4. (a) 624; 1560; (b) 0.01; 100; (c) —1; 0.00011. 
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5. (a) a Ax; a; (b) 3a? Ax + 3x(Ax)? + (Ax)3; 322 + 32 Ax + (Ax)?; 











2a Az + (Ax)? 2z +4 
O =~ wea: (DV A2) — Ve; 
1 
Va 4- Y (z + 42) 
: Es a+de | 1 æ + Ax 
(e) 25 (24% — 1); 2* ; (f) log 24 a log - 


6. (a) —1; (b) 0.1; (c) A; 0. 7. 15 cm/sec. 
8.7.5. 9. e im f (x) . 10. jg ee f (x) =f (a) 
ae 


11, (a) 4p/4t; (b) d®/dt = lim 52 where O is the magnitude of 
4t>0 


Jisgiven. 


the angle of rotation at the instant t. 
12. (a) AT/At, (b) dT/dt = lim AT/At where T is the temperature at 
At-+0 
the time £. 13. dQ/dt = lim 4Q/4t where Q is the quantity of 


At+0 
matter at the time f. 


14. (a) 4m/Azx, (b) Em Am[Az. 15. (a) —1/6; (b) —5/21; (c) 50/201; 
+0 
y'(2) = —1/4. 
1 
16. sec^z. 17. (a) 32^; (b) — 2/2*; (c) ER (d) — cosec?z. 
18. 1/12. 19. f'(0) — —8, f'(1) — 0, f'(2) — 0. 
20. 30 m/sec. 21. —1; 2; tan  — 3. 22. (a) + œ; (b) co; (c) —1: 
' 1 , 1 
f' (ka + 3-2) = — 1, f (kao + 0=1. 
23. 52^ — 12a? + 2. 24. — + 2x — 2a?. 25. Qax + b. 
26. —15a2/a. 27. mat”? + (m + n) bt^*"—1, 28. bazë. 29. —afa?. 
30. 2318 — 5gt?12 — gy 4, 
Sya. 92, © tg? — 2 gan 
33. (bc — ad)/(c + dx)?. 34. —(22? + 6x — 25) (a? — 5x + 5). 
rae 
C O Nor. qu Scc t. 
35. (1 — 4z)/a?(2x — 1) Ve — yay 
37. 5 cos x — 3 sin x. 38. 4 cosec? 2x. 39. —2(sin x — cos z) ? 
40. ?? sin £. 41. 0. 42. cot x — x cosec? z. 43. arc sin x + z/J1— 2’. 
44. z arc tan x. 45. (x + 7) a5 e*. 46. xe”. 47. (x — 2) x? e. 
48. (5x* — a5) e^*. 49. e*(cos x — sin x). 50. a? e”. 
51, e” (arc sin g + 1//1 — a?) 52. z(21og x — 1)/log?z. 
33. 32? log x. 54. y = 2/x + logz[z? — 2[z?. 
55. Jac ?(ax + b)?. 56. 12ab + 185? y. 57. 16x(3 + 2a?)*. 
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c ba? 





eu 
. 99. -———. 60. 
ante gm us 


61. — lY(ajz — 1 
62. (1 — tan? x -+ tant x) sec? x. 63. — > cosec? z (cot 2) 
64. 2 — 15 cos? x sin x. 65. —16 cos 2 cosec? 2£. 66. sin z/(1 — 3 cosa)? 


67. sin? x/cost x. 68. 


58. 





—1/2 








= (3 cos x + 2 sin a)/V15 sin x — 10 cos x. 

















69. : cos ysin “U? g + 3 sin æ cos" * x. 
1 3 (arc sin 2) 
70. VA — a? 71. == 
0 va x*)(1 + arc sin 7). Ae ever dls ar il 
1 ze* -- e* + 1 2e* — 2log 2 
72, — . 48, — — . 74, 
(1 + 2) (arc tan x)? 2Yz oz 3y Go — FFI 
5 logt x 

HET 
75. 15 sin? 5a cos 5x cos? 1/3 — i sin? 5a cos 2/3 sin 2/3. 

4x 4-3 a +-da — 6 
T6. oe TES 
78, 27 (1 — a2)-5. 79, 2 4. 80. (a? + a2)92, 


x2 V 20? — 2x +1 
81. 2? (142) 
82,10 %(1 + ya)*. 83. a5(1 + x)?’ 84. (x — X) a+ 27 
85. 4x*(a — 22?) (a — 5a’). 














86. 2abmna" 1 (a + bay 87. at —1 88. 4 3x 
(a — ba?ynti (a 4- 2)8 2Va—x 
322 + 2 (a +b + ¢) x + ab + be + ca 1+2Vy 
89. . 90. — —. 
2 Y [(z + a) (w+ b) (v 4- c)] 6Vy(y + Vy)? 


91. 2(7t + 4) (3t + 23%. 92. (y — a) (2ay — y?) ?^. 93, (1 + e?) * *. 
94. sin? x cos? x. 95. (sin x cos x) ^. 96. 10 tan 5x sec? 5x. 
97. x cos 2?. 98. 3i? sin 20?. 99. 3 cos x cos 2x. 100. tant x 
i01. ^ 92, Den n. 
sin! yx 2 Y [a sin? + B cos? 2] 
103. 0. 104. 5 b es a2) "are sin(2 are cos x — are sin 2). 
105. dy (t — 1)7*?, 106, (1 + 22). 
107. (1 — 22) ?? aro cos x — YI — zê]. 108. (a — bz?) ^. 
109. (a — z)/(a +æ). 140. 2 ya? =x. 11 —a(2x — a?) 1^. 
112. arc sin x. 


5 1 sina 
113. . 11 OS pee < 115. ee ee Roa A Y 
Y (X — 262%) are sin 5% ay (1—log? a) 1 — 2a cos a + a? 


116. (5 + 4sin 2)71. 117. 4x y[x/(b— x)]. 118. sin? z/(1 + cos? x). 
119. i a e**, 120. sin 2x0 '"*, 121, 2m? p(2ma™ +b) ~ a™ log a. 
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122. (a cos Bt — £ sin £ t) e” 
123. e** sin f x. 124. e^* cos 3a. 125. a^^! a" (n — 227 log a). 
126. — x sin z cos !*y a/*95* (1+ log a Ycosz). 


127. 3x7? (sin i cot i log 3. 


128. (2ax + b)/(ax? + ba + c). 129. (a? + 2?) 1. 139. yzJ(1 + ya). 
131. (x? + 2az) !?. 132, —2z^11og-3 x. 133. —2? tan[(z — 1)/x]. 


3a? — 16z + 19 
134. (20 + 11/02 — £ — 2). 135. Gee a 


136. 1/(sin? z cosx). 137. yx? — a. 
— 62? 15a log? (ax+-b) 
138. (3—2 23) log (3 — 2 23) 139. (av4-b) ` 
140. 2(2? + a?) 1^. 141. (ma + n)/(a? — a?). 142. yZ sin log z. 
143. sin ? x. 144. z 1 V1 + a?. 145. (x + Y)/(x? — 1). 
146. 3//1— 92? [2% *^ ** log 2 + 2 (1 — arc cos 32)]. 


: b b b 
147, (3% 2x/cos Jog 3. | sin? ax[cos? bx) x a cos ax cos be + b sin az sin ba 





cos? bx 
148. (1 + 2 sin z)^ 1. 149. z^! (1 + m x) 
1 262 : —1 c 
Ih —a? - arcsin x Tos TOI =a x BE x (1+log? x) 


152. 1*/(x* + 3? — 2). ies: 2 cos? æ sin t? æ. 154. (2? — 3x)/(a* — 1). 
155. (1 + 23)-1. 156. (1 — 2?) ?? arc sin z. 

157. y’ = lifz > 0, y’ = —1if x < 0; y'(0) does not exist. 158. | 2x |. 
159. 1/x. 160. f'(z) = —1, x < 0, f(x) = —e*, x > 0. 

161. 1 — x. 162, 2 + + (w — 3). 163. —1. 164. 0. 

170. (1 + 22) (1 + 3x) + 2(1 + 2) (1 + 32) + 3(x + 1) (1 + 22). 
171. —(a + 2) (ba? + 19% + 20) (£ + 1)7* (x + 3)-5 

172. ic — 4z + 2) [x (x — 1) (x — 22]. 


173. 5 (8a? + 5) a? (a? + 177%, 174, jg a LEES. 

175, z- E + + log 2). 176. 37 ae delega lods 2). 
Wie ane + cos z log 2). 

178. (cos x)*"* (cos æ log cos x — sin z - tan 2). 

179, P + ahs (+ il à: rnb 180.2 (2. 181. = i 


2 — [3 2 t+1 
A 184. 2. 4-2/6, 185. Epi 








186. tan à 187. an nee ula tan t. 
189. —tan(3t). 190. y^ = —1ift < 0, y’=+1ift> 0. 191. —2e*. 
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192. tan t. 193. 1. 194. co. 195. No. 197. Yes. 198. 2/5. 199. —B? xja? y. 

200. —2?/y?. 201. —a(3x + 2y)/(x? + 2y). 202. —y/x. 203. — (yx)! ^. 

204. (1 — y3)/(1 + 3xy? + 4y?). 205. 10/(10 — 3 cos y). 206. —1. 

207. y cos? y/(1 — x cos? y). 208. y(1 — x? — y)/x(1 + a? + y?). 

209. (£ + y). 210. (2 + y — 1)-1. 211. e" + y/o. 

212. y/(x — y). 213. (x + y)/(x — y). 

214. [cy + 2/22 + y?]/[cx — yx? + y?]. 

215. y(x log — y)/x(y log x — x). 216. 45°; arc tan 26326”. 217. 45°. 

218. arc tan(2/e) ~ 36°21’. 219. (0, 20); (1, 15); (—2, —12). 

220. y = a? — æ + 1. 221. — 1/11. 222. (1/8, — 1/16). 

223. At (4, 2) tangent is z — 4y + 4 — 0, normal is 4x + y — 18; 
at (4, —2) tangent is x + 4y + 4 = 0, normal is 4v — y = 18, 

224. y — 5 = 0; x + 2 = 0.225. x = 1, y = 0. 

226. Tx — 10y + 6, 10% + Ty — 34 = 0. 

227. y = 0, (x+ 4) + (x— 4)y— 4 yZ = 0. 

228. 5x +- 6y — 13 = 0; 6x — 5y + 21 = 0. 229. x -+ y = 2. 

230. At (1, 0) y = 22— 2, x + 2y = 1; at (2, 0) x +- y = 2,4 — y = 2; 
at (3,0) y = 2x — 6, 2y +- x = 3. 

231. 14x — 13y + 12 = 0, 13x + 14y = 41. 

235. 40°36’. 236. At (0,0) the curves have common tangent; at (1,1) 
they intersect at an angle arc tan 1/7 ~ 8°8’. 

238. The subtangent and the subnormal are 2, the tangent and 
normal 2 y2. 

239. 1/(log 2). 242. 3 cm/sec.; 0; —9 cm/sec. 243. 15 cm/sec. 

244. B moves towards 0 with velocity 1.5 cm/sec. 








245. (a) y = x tan a — +9 (a/v, cos a)?; (b) vo sin 2a/g; 


(c) Vo — 2v, gt sin a + g? 1? making an angle 
tan”? [(v, sin a — gt)/v, cos a] with the x-axis. 

246. —0.4 units per second. 247, (9/8, 9/2). 

248. (a) 1.2 10 ~ 3.8 cm/sec; (b 40) cm/sec. 249. - x cm/sec. 

250. 5626 + 210%1. 251. 2(22? + 1) e”. 252. 2 cos 2x. 

253. 2(1 — a?)/3(1 + 2%)? 254. —a(a? + 22792, 255. Qare tan a + 
+ 2z[(1 + 2). 256. 2/(1 — a?) + 2a(1 — a2)?" are sin z. 

261. y” = 6. 262. 4320. 263. 24(x + 1)75. 264. —64 sin 2x. 

266.0; 1; 2; 2. 267. Speeds are 5; 4.997; 4.7; accelerations are 0; 
— 0.006; —0.6. 268. n! a”. 

269. (a) n!(1—2) €*?; (b) (—1)"" 1.3.5...(2n — 3) 27^ gt", 
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270. (a) sin (x + + na); (b) 2” cos (2x + 4 nn); (c) (—3)"e *; 
(d) (DY (n — DIAS (e) CIP nt 4 2) 77; 
(f) 2n! (1 — 2)77755 (g) 277 sin [22 + $ (n — 1)a1]; 
(h) (—1)" (n — 1) ! a" (ax + 5)". 
271. (a) xe” + ne*; (b) 2" e [2(—1)" 3? + 2n(—1)" ^! x + 
+ mía — 1) (—1)""7]; 
(c) (1 — 2?) eos (x +- an) — 2nz sin (x p (n — 1) a); 
(d) (—1)7 1.3.5. ... (2n — 3) 2" à "+ [e — (2n — 1)]; 
(e) (—1)" 6(n — 4) 1 a?7" if n > 4. 
272. (n — 1)! 273, (a) 925; (b) 2(Y + t$); (c) —y(1— 9). 
274. (a) —a 1 sin™? t; (b) (3a cost t sin t) 3; (c) — (4a sint —=- ys 
(d) —(at sin? t) 1. 
275. (a) 0; (b) 26°™. 276. (a) (1 + £) (1 + 32); (b) t(1-- t) (1 — 78. 
277. —2e"'(cos t + sin f)-?. 278. 1. 280. Ay = 0.009001; dy = 0.009 
281. 1. 282. dS = 2x Ax, AS = 2x Ax + (4x)?. 
285. At x = 0. 286. No! 287. —a[12 ~ —0.0436. 
289. (a) 0.485; (b) 0.965; (c) 1.2; (d) —0.045; (e) 0.81 radians. 
290. 565 cm?. 291. /5 ~ 2.25; VIT ~ 4.13; /70 ~ 8.38; /640 ~ 25.3. 


292. JIO ~ 2.16; JTO ~ 4.13; /200 ~ 5.85. 

293. (a) 5; (b) 1.1; (c) 0.93; (d) 0.9. 294. 1.0019. 

296. No; since f'(2) does not exist. 297. No; since f (2/2) does not exist. 

300. — 1/3. 301. co. 302. 3. 303. 1/2. 304. 2/7. 305. 1. 306. 0. 

307. oo, if n > 1; a if n= 1; O if n < 1. 308. 0. 309. 1. 310. 1/5. 

311. (a) 0; (b) 0; (c) 2; (d) e*; (e), (f) the limit does not exist. 

312. The function is continuous. 

313. (a) x = 0, y = 0; (b) all points of the line y = z; (c) all points 
on the circumference of the circle x? + y? = 1; (d) all points on 
the coordinate axes. 

314. dz/dx = 3(1? — ay), 0z[0y = 3(y? — az). 

315. 0z/0r = 2y(x + y) ?, dz/ay = —2x(x + y. 

316. 92/0x = — y/x?, 92/0y = 1/x. 

317. 0z/0x = x(x? — y?) 1, d2/0y = —yla? — y?) *?, 

318. 0z/0x = y*(a? + y) ??, 92/0y = —ay(x? + y?) +”. 

319. 92/03 = —(u? + y?) ^^, dafay = yl +y) [e + Mer. 

320. 02/02 = —y/la? + y?), 92/0y = x/(a? + y?). 

321. 0z/0x = yx", dz/ay = x” log x. 

322. 0z/0x = —yx"? z, 0z]0y =u 12. 
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323. 02/0x = xy? 22? — 249 ^ /[| y | (ut — y2)], 
d2/ay = —ya (24? — W?) E y | (at — y*)]. 
324. 92/04 = a cot [(z + a) y”*?], 
dz/ay = — + (x + a) y aH cot [(z + a) y 1^]. 
325, u, =y 277, uy = 0 - logs: 2y. 
326. ux = y/(1 + ay), uy = x/(1 + xy). 
327. f, = 1/2, f, = 0. 328. f, = 1, fy = 1/2. 
329. Af = 44x + Ay + 242? + 24x Ay + Ax? Ay; df = 4dx + dy 
a) Af — df = 8; (b) Af — df = 0.062. 
331. dz = 3(a? — y) dz + 3(y? — a) dy. 
332. dz == 2zy? de + 32? y? dy. 
333. dz = 4(a? + y?) xy? de — x? y dy). 
334. dz = sin 2x de — sin 2y dy. 
335. dz = y? a" de + (1 + y log x) dy. 
336. dz = 2(a? + y?) Uedx + ydy). 337. df = (x + y) (dx — «dyly). 
338. dl = 0.062 cm, 41 = 0.065 cm. 
339. (a) 1.00; (b) 4.998; (c) 0.273. 
340. z(a g — B1)/g Vig. 341. e'(t log t — 1)/(t log? t). 
342. y 1? t(6 — 2/2y?) cot(y”*” x). 
343. 2é(log t) (tan £) + (t + £7?) tan t + sec? ¿(1 + t?) log t. 
344. 0. 345. (sin (cos x cot x — sin x log sin 2). 


Chapter III 


1. 2 a? a? + e. 2. 223 + 4a? + Sx + c. 


M. e l (a + ii a? + -5 aba? + c. 
4. a? x + aba + — peers 5. 2a) (2px)/3 + c 
um 
n 


3 UE Lo 7. (na* 


J 9 j I 1 
8. ate — - 5. at qiu cum — -y S +. 





2 | 3 f / / 
9. = 0 +c. 10. 13 gr 2 aria? e. 





Qx2M+4 , — 4gmtnià 9g2n-à 
m41 ' m+ In 41 moore 
in x j DA zia aa 
12, 207? 91? — 4a: + 40? a9? — 992 + £9 7x7. 


11. 





x —y10 
z+ yio 





13. eae tan + c. 
yi y 





log |+ e 


hee 
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15. log[x + /4 F] 4- C. 16. are sin 7 +0. 


17. arc sin 77 —log [x + y (34-2) J+<e. 

18. tan x — x + c. 19. x — coth x + c. 20. (3e)"/(log 3 + 1). 21. 6 — a. 
22. o, T — + gT*. 23. 3. 24. (219 — 1)/log 2. 25. 156. 26. log x. 

27. — V (1-23). 28. 2x0 * — e7*'. 29. i z^? cos x + z-? cos (271). 
30.2 = n m,n = 0, 1, 2, .... 31. >. 32. log 2. 33. (p + 1). 34. 7/3. 
35. 100/3. 36. 7/4. 37. 16/3. 38. — 2/3. 39. + log 2/3. 40. log 9/8. 

41. 35-1. — 32 log 3. 42. arc tan 3 — arc tan 2 = arc tan 1/7. 

43. log 4/3. 

44. 2/16. 45. 1 — 1/3. 46. a log |c / (a—z) |. 47. x + log | 2x + 1| +c. 
48. —204 Lig [3 + 2r | - c. 49. E — Flog|a + bz| + o. 
50, Ly MA log Las B| +e. 

BL. 3a? + x + 21og |x — 1|. 52,74% + 2x + log|a+3|+c. 

53. a Ena 2x + 3logg|z — 1| 4- c. 

94. a? x + 2ablog | x — a | — P?/(x — a) + c. 


95. log |a + 1| + (x + 1)71. 56. SE d 57. — 4 (a — ba^. 


59. + log |a? — 5| + c. Geb (ung 3) +c. 

61. zr! (a? x? 4- b?) ta arc tan Á + c. 62. = are sin 57 +e. 
63. “are tan 2? + c. 64. 1. y log |? - y«* —1 |+ c. 
65. 7 y (are tan + aJ : 66.2 (are sin z)?/?. 67. (—a/m) e^"* + c. 
68.0 — 4?-?*/(3 log 4). 69. e* +e*+c. 
70. i a(e*** _ g73xla) 1 2w + c. 
71. (log a — log b)71 (a* b * — a7* b”) — +c. 


a?*!2 — 4x e+?) 
72. Toga? t gest * duda riot 


74 Leb gg 15. TBs rd X41 ¢, 
77. log|e* — 1| + c. 78.0 — + Z- (a — be" vn 79. C —4 (6 in 








80. C + flog |at — 4x + 1|. a ge tan e. 82. O— se e 
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3 3 1 = 

83. IE arc tan 18 =% log [2/3 + V2 4-32? ] - c. 
84. Jepe La 2 log |z 4-1] 4- c. 
85. rone +. 86. log|z + did +e. 
Si z [log | sec 3z--tan 3z | + B is} +e. 
88. C — (log z)—1. 89. log | tan x + /tantx—2 | + c. 

1 sin x 
90. y arc tan(x /2) + € — TFI 91. C + a®"*/log a. 
92. 4 (a3 + 1} + c. 93. are sin (2?) + c. 
94. qeu) — g +o. 

1 


95. — La— sing+ c. 96. aro sin (- tana) + o. 


97. a log |tan ES E 47) |+ c. 98. — 2 = (1 + log a)? 4 c. 
99. C — 2 log | cos Va — 1 |. 

100. > log | tan (538) | +. 

101, emetanx | 7 log? (1 + a?) + arc tan g% -- c. 

102. C — log | sin x + cos x}. 

103. C — /2log| tan -7z | — 22 — /2 cos 4 c. 

104, z+ Flog = “A T 

105. s Jæ | 4- 20 tan x + C. 106, e^ * y c. 


107. 7 arc sin 218 y, V(4— 3x2) + c. 108. x — log (1 + €) +e. 


1 a—b 
109. Ya arc tan ¡Ez æ+ c. 
110. mem Ve* — 2). 111. il log | tan az |+ c. 
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112. C — -2— cos ins + Ø). 113. 4 log] (2+ log z)/(2 — log x)| + c. 
114. C — i [OM COS > zy. 
115. C — e7 MX 116. > are sin bs sin? z) + c. 117. c—2 cot 2x. 


118. s 1 (arc sin x)? — V1 — a? + c. 

119, ing [sec x + Vsec? x + 1] + c. 
1 Y5 + sin 2a 

120. log — i 

2 ne OB y singe 5 





y2 y2 


121. zl aro tan bs tan z) +c. 122. o [log (x + Va? + 1}}3? 4 c. 
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123. (a) C + 1/2 are cos /2/x, if a > 2; (b) C — log (1 + e””); 
(c) € + Ja (Sa? — 3)8; (d) (e + 192 — Ao + 1) 4 c. 
(e) log [sin x + /1 + sin? x] + €; 
124, [(2x + 5/12 + 5(20 + 5)3/11] +c. 
125. 2 (> 292 +4 + 2 /x— 21og|1 + /x| +): 
126. log |(y2x + 1—1)/(y2z + 1 + 1) + c. 
127. 2 arc tan /e* — 1. 128. log x — log 2 log | log x + 2 log 2 | + c. 
129. i (are sin x)? + c. 130. 5 (© — 2) (+ 1)? + o. 
131. : (cos? æ — 5) cos + c. 132. log x — log [1 + 2? +1] + c. 
133. 0 — +e y1 = a+ jare sin 2. 
134, 0 — 4 at V2 — a? — 2 2 — e. 
135. x? — a? — a arc cos a/z + c. 
136. arc cos 1/x + c if z — 0 and arc cos (— 1/2) + c if x « 0. 


137. yz? + 1 — log [1 + yz? + 1] — log x + c. 

138. C — i- x y4— a. 139, I yi — a? + i arc sin x + c, 
140. 2 arc sin /x + c. 141. vlog x — x + c. 

142. z arc tan z — = log (1 + 22) 4- c. 

143. x are sin x + V1 — a? + c. 144. C + sin x — x cos x. 

145. C + dz sin 3x + 4 cos 3x. 

146. C — (x + 1) e*. 147. C — (x log 2 + 1)/(2* log? 2). 

148. 5 (02? — 6x + 2) 0% + c. 149. € — (2? + 5) 07. 

150. C — 30 Pad + Oa? + 54a + 162). 

151. C — qe cos 2a + i sin 2g. 








152. lu + 10% + 11) sin 2a + + (2m + 5) cos 2x + c. 

153, $2 logz—5 +c. 154. x log? x — 2x log x + 2x + c. 

155. 0 — la^ logz pa. 156. 2 Vx logz— 4 yz -+ c. 

157. i (2? + 1) arc tan ae +c. 

158. Pec arc sin x — Faro sin z + qe yi> r? + c. 

159. x log[x + V1+2?] — /14 x? + c. 160. C — x cot x -+ log] sin x |. 
1 l 1 

161. y are tan 3; -F 1)-4 c. 162, log] ssl +c. 
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—1 

163. ne Ui + i T€ 
2 —— 

164. + log (a2 — Tx + 13) + vs RO yi 4 c. 
165. Ž log (a? — 4x + 5) + 4 are tan(x — 2). 
166. z— $log (2? + 38+ 4)+ aro tan m +e. 
167. x y 3 log pu — T 10) + 8 arc tan (x — 3) + c. 
LM c. 169. arc sin(2z — 1) + c. 














168. y erosin = 
PUERUM 
171. zie o. 172. x + 3log| z —3| — 31og| z — 2| + c. 








iJ 3)3 (x 4 (x — 4)5 
173. 3 log| EID -- c. 174. log TER +e. 





175. Be + loge + gon (z — 4) — $ log (s — 1) + e. 

176. (1 + a)! +log| 5 yI p 

177. a > log (20— Ue 9 log(2 + 1) + log x +c. 
11 

178. ¿8 CN GE 4 c. 

179.0 — 2 (w — 87 — 4 (e+ ae 


27 u—5 
180.0 + 0657 Dehn + 343 wi adr | 


181. o Fete 2)7?, 182. a + log|ar| — + log (a? + 1) + c. 





o Jog 








183. tele m Rd 


24-1 
—3|- gglog |2 — + g log (a? + 4a -+ 5) + 


is => arctan (x +2) + c. 185, sin g — y sin’ x te. 





Il E 
186. A ate = cos’ g, 187. + sin? g — + sina + c. 


188. q coss Z — 3 cost = g 6 


189. + sin? — e log| sin z | + c. 
3 l . lo. £ 1 

190. -~v — y Sin 2a + -zy sin 4a + c. 191. — — 37 33 sin 4% + c. 
1 1 i uos 

192. tg LE" R 22 + c. 


193. Fee + y EXE Lau RUE 
194. C — cot x — E cot?z. 195. tan x + i tan? + + tang + c. 
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196. C — = cotta — < cot z. 197. C + tang + i tan? x — 2 cot 2x. 
198. + tanta + 3 log|tan z| — EA cot? g — + cot! a + c. 
199. sec? > x+2 log | tan + +e. 
200. e 2 [log] tan 3-2 |+10g tan ( (x2 4)]*« 
201. o- nr cos & ed — -g 008 t cosec? x -+ 108] tan 2 +e. 
202. 36 sin 4c sect 4x + 3 sin 4z sec? 4g -+ 
+53 os tan | Qe + ED i M 
204. C — i cot? x — log|sin x|. 205.0 — + cot? g + cot a+ a. 
206. i tan? E + tans —- — 3 tan A 8log | eos i-e | +a+e. 
l ius + sin 22? + c. 
208. C — i cost/? x; + i- cost? q — E cost g c. 


209. 2 Vtan z + c. 
































210. T log (2? + /22 + 1)/(2 — y22 + ee tan mats + 
+ c, where z= ud 211, C — ig 908 8c +4 cos 2x. 
212. C — jy sin 26% + ¡y y sin 5x. 213, A 

IRA ta 215, ¿7 sin 2az + 2 cos 2b + c. 
1 1. 
216. -g Y cos p — 4. Sin (20x + 9). 
217. ur sin x +3 39 sin 52 t sin Tx + c. 
1 °, 1 
* 24 16 8 
1 tan 3 2—2 
219. -, leg m MUS +e. 
tan 142 
220. 750% tan (Er) |+ e. 221. z— tan—-z4e. 
222. C — x + tan x + sec 2. 
tany 2 — 5 
223. log ——(— —- | + e. 224. are tan (1 + tan 2) + c. 
tan 2 —3 


12 


225, 3 2 — 5 log] 2 sina + 3 cos z| + c. 
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226. C nomi 227. + aro tan (+ tan ? 

228. -> (v 1) V3 = 2z af + 2 arc sin (x + 1 1) + 

009.1 a FEB poa PPA. 

230. —- £ fa? +9 — -> log[z + Ve? +9] + c. 

231. 5- (z — 1) a? — 2x + 2 + > log [s—1 + Va? — 204 2]. 

232. - a Vn? — & — 2 log [a+ — 4 | + c. 

233. C + -- (22: + 1) V F æ — + log] 224-1 + 2 Va? +a. 

234. 2. 235. Divergent. 236. 1/(1— p) if p < 1; divergent if p > 1. 


237. Divergent. 238. x. 239. Divergent. 240. 1. 


241. 1/(p — 1) if p > 1; divergent if p < 1. 
242. n. 243. 2//5. 244. Divergent. 245. Divergent. 246. 1/log 2. 


247. Divergent. 248. 1/log a. 249. Divergent. 250. 1/k. 251. + n. 








. Convergent. 


256. Divergent. 257. Convergent. 258. Convergent. 259. Convergent. 
260. Divergent. 261. Convergent. 264. 32/3. 265. 1. 266. 1/2. 267. 17/4. 


268. 2. 269. log 2. 270. .-$- ph. 274. 9/2. 
275. 32/3. 276. 4. 277.-5- a?. 278. 15 x. 279. + mab. 280. 37 a. 
281. x(0? + 2ab). 282. 6x a?. 283.5 a2, 284. È x a. 285. "s na? 
286. a?. 287. — xa^. 288.71. 289.2 (14 — 8/2 a? 
290. a 1 — ep. 
291. a [7-2 + 5 V8). 292. a y2. 293. -57 = (10 TO — 1). 
294. |/2 + log (1 + 1» ). 295. 1+ e? — y — 1+ log [J14- e?—1] + 
+ log (VZ + 1). 296.1 + Slog’. 297.log[e + Ye*—1]. 
298. log(2 + 3). 299.4 (1 + e). 300. 4 /3a. 301. 5 aT”. 
302, 4(a3 — b8)/ab. 303. 16a. 304. xa JIF 42 + + alog[ 2a + 
+ VIF 47]. 
305. 8a. 306. 2a[y2 Host +2)]- 307, 7 a5/30. 308, $ x ab?. 
309. È 2. 310. (a) Z; (0). 311. 2 na. 312. 37/10. 








313. > (15 — 16 log 2) 2 aè. 314. 27? a*. 315. + z keH. 
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316. 15 zh? a. 318. (a) 52? a3; (b) 62? a?; (c) T n a? (92? — 16). 
4 8 2/9,2 
319. 105 n a8. 320. 57 na?. 321. 7 a. 322. x abh(1 + R2/3c?). 


323. $ mabe. 324. 22 [VZ + log (1+-//2)] 
325. a (V5 — /2) + zlog 2(/2 + 1) — wlog (1 + y5). 
326. 27 [/2 + log (1 + y2)]. 
327. + nae? + e73 + 4) = > z a*(2 + sinh 2). 
329. + z (e — 1) (è + e + 4). 330. 42? ab. 
331. o 27 b? + 2x ab &71 are sin e; 

(2) 27 a? + n b? e1llog(1 + e) —log(1 — e) ], where e=(a?—b3)18/a. 





2 





332. (a) E z a; (b) 16 a? a?; (c) que 333. LE na’. 
334. M, ny mae — ia yF b. 
335. M, = i ab, M= bab. 
-_-_1 
336. M, = =M, =5 Si=j=34 
337. M, = M, = Fas 3 = Y = UM 338. 27 0. 
339. x = (a sin a/a), y = 0. 340. z = na, Y= <a. 


341.3 = 4a[32; y = 4b/37. 342. 3 = y = 9/20. 348.2 = xa, y=— a 


344. (o; 0:3 a). 345. (o; 0; 7h). 346. (o, 0,3 a). 347. na. 
348. I, = + b°, I, = ab. 349. hb. 

350. És = y mabe, I, = Lab. 351. gy aei icq 

352. 3p z r* h o (o = density). 353. 2 zm 

354. V = 22? a? b, S=4n® ab 355. 5. a. 


Chapter IV 


1 1 z E m 
Lot 2e $5427 a 8. (n + 2) (n + 17 


6. 2n/(3n + 2). 

7. 1Un(n + 1). 8. 1.3.5...(2n — 1)/1.4.7...(3n — 2). 9. (—1)"7t. 
10. n". 11. Divergent. 12. Convergent. 13. Divergent. 14. Divergent. 
15. Divergent. 16. Divergent. 17. Divergent. 18. Divergent. 19. Diver- 

gent. 20. Convergent. 21. Convergent. 22. Convergent. 23. Con- 
vergent. 24. Convergent. 25. Convergent. 26. Convergent. 
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27. Convergent. 28. Convergent. 29. Divergent. 30. Divergent. 

31. Convergent. 32. Divergent. 33. Convergent. 34. Convergent. 

35. Convergent. 36. Divergent. 37. Convergent. 38. Divergent. 

39. Convergent. 40. Divergent. 41. Divergent. 42. Convergent. 

43. Divergent. 44. Convergent. 45. Divergent. 46. Convergent. 

47. Divergent. 48. Convergent. 49. Divergent. 50. Convergent. 

51. Convergent. 52. Convergent. 53. Divergent. 54. Divergent. 

55. Conditionally convergent. 56. Conditionally convergent. 

57. Absolutely convergent. 58. Divergent. 59. Conditionally con- 
vergent. 

60. Absolutely convergent. 61. Conditionally convergent. 

62. Absolutely convergent. 63. Absolutely convergent. 64. Divergent. 

65. Absolutely convergent. 66. Conditionally convergent. 

67. Divergent. 68. Absolutely convergent. 69. Absolutely convergent. 

70. Conditionally convergent. 71. Divergent. 72. Absolutely con- 
vergent. 

73. Absolutely a 75. Yes. 76. Convergent. 

s| < b R,<0,R;>0. 








T8. En < irl + PUITS 
79. Ra < (n + 2)/(n + 1) (n + 1)! ; Rio < 3x107*. 
Rc dd n —2 <x < 2. 82 —l1<x<l. 


83. — 750 oe 84. —1<zx<1. 85 —1<x<l. 
86. —1 < x < 1. 87, —œ < x < co. 88. x = 0 only. 
89, —co < y < co. 90. —4 «2 «4.91. — - «2 « 4- 


92. —2 < x < 2. 93. —e < x < e. 94. |z| < 1. 95. 
i | «3.97. |z] < y2. 98. 2 = 0. 99. |z| < co. 
100. |z | < 1/2. 104. —log (1 — 2), (—1 < æ < 1). 

105. log (1 + x), (—1 < x < 1). 106. -jlog EE, (|x| «1 
107. Arc tan x, (|æ | < 1). 108. (x — 1)7?, ([x | < 1). 

109. (1 — a?) (1 + 22)7?, ([z | < 1). 110. 2(1 — 2)75, (1x |< 1). 
1H. x(a — 1)7?, (|æ | > 1). 


1 
112. 5 

















1 l—z 
[arc tan z — 318 3 (|« | « 1). 


1 
113. = x /3. 114. 3. 


115. —78 + 59(x + 4) — 14(x + 4)? + (x + 4)8, (~œ < € < œ). 
116. 5r? — 4a? — 3a + 2 + (1532 — 8x — 3) h + 
+ (15% — 4) h2 + 5h3 (—co < $ < œ, —oo « h < co). 
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117. S (1) (a — 1)"/n, (0 < æ < 2). 
a 

118. or (c — 1)", (0 <2 < 2). 

119. 7S (n + 1) (æ+ 1)", (-2 <z « 0). 
n=0 


120, es xem (|| <co). 
n=l i 








gS ae E Bx. (0 € z « 8). 
122. X (— y eua (e| < eo). 

123, ya aaa e (~œ «z« o) 

124,14 TAG a", (e| « co). 

125, 8 + 3 EIEEEI gh, (|| < en) 
e SW ie | (lol <8). 


Chapter V 


1. 5/3; —2. 2. (y? — x°) /2xy, (3? — y?)|2xy, (y? — 2?)/2xy, 22y/(2?* — 3?). 
3. f(x, 2) = 1 + x — 2. 4. 2=R(1 — BR). 5. f(z) = y(Y + 3/2. 
. : (a? — ay). 7. flu) = u? + 2u; z= x — 1 + y. 
8. f(y) = V1 + ys z = Ya? + y. 
9. (a) The interior and circumference of the circle x? + y? = 1; 
(b) z only exists along the line y = 2; (c) the half-plane x + y > 0; 
(d) The strip —1 < y < l; (e) The square |z| <1, |y] <1; 
G) IvI «Iz @ |2|2 2, |y « & (b) The interior of the 
annulus bounded by r = a, r = /2a; (i) The regions 2nz < x < 
< (2n + l)x in the half-plane y 2» 0 and the regions 
(2n — Da « x < 2nz in the half-plane y < 0; (j) 2? + y > 0; 
a the whole xy-plane; (I) the whole xy-plane except (0, 0); (m) 
> 0, y > 2%; (n) the whole xy-plane except the lines z = 1 
E y = 0; (o) the interior of the rings 2nz < x? + y? < (2N+1)7 
(n = 0,1,2, ...). 
10. ux = yalay 3, Uy = xz(xy) ^, u, = (zy) log (xy). 
11. wu, = yz? log z, wy = xz" log z, u, = xyz. 
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12. u, = 3(a? — yz), uy = 3(y? — az), u, = 3(2 — xy). 

13. u, = 2zy? t/(Y — 2), uy = 22? p(l — 22), u, = 22? y at/(1 — 2), 
My = P P — 22). 

14. u, = —zr ?, uy, = —yr 3, u, = —er 3. 

15. vu, = y'z[(x + yy (2 + 1), uy = gale + yY (2 + 1), u = 
=xyl(e + y) (z + 1). 

16. f, = 1, fy = 1/2, f, = 1/2. 17. 1? sin 0 drdé dg. 

28. 02/0z = — y/(x? + y?), dzjdx = 1/(1 + 2?). 


29. 02/0% = ya" ^, dzjdz = a” [y (x) logg + 2] 


30. 02/02 = 2z Cf/au + ye” of[0v, 0z[0y = — 2y Əffðu + xe” df/dv. 
31. 92/0u = 0, 02/04 = 1. 
1 ? 1 
32. 02/0x = y (1 £s =| f (u), O2/Oy = (2 a =) F (u), (u = ay + Y) 
36. 922/04? — abcy?/(bta? + a*y?)®?, 9022/0: dy = — 
— abexy/(bta? + ary2??, 922/0y? — abca?/(bta? + atya. 
37. 022/07? = 2 (y — x*)/(x? + y}, 0?2/07 09 = — 22/(2? + y)?, 
Belay? — —1/(x* + yy. 
38. 07/0x dy = xy/(2xy + y”. 
44. z = 0 (min.) at (1, 0). 45. No maximum or minimum. 
46.2 = —1 (min.) at (1, 0). 47. z = 108 (max.) at (3, 2). 
48. z = —8 (min.) at (/2, —/2) and at (—/2, y2). 
49, 2 = a (max.) at. (+a//3, +9//3) 


E v (min.) at (+ a//3, F B//3). 


50. z = 1 (max.) at (0, 0). 51. z = 0 (min.) at (0, 0). 

52.2 = //3 (max.) at (1, —1). 

53. (a) The two functions z = 3 + //25— (x — 1)? — (y + 2)? are 
defined by the equation. The (+) sign gives a function which has 
a maximum value 8 at the point (1, —2); this function takes 
lts minimum value 3 at all points on the circumference of the 
circle (x — 1)? + (y + 2)? = 25, the function not being defined at 
points outside this circle. The (—) sign gives a function which 
has a minimum value —2 at (1, —2); this function takes its 
maximum value 3 at points on (x — 1)? + (y + 2)? = 25, not 
being defined at points outside the circle. 
(b) One function has a maximum (z = —2) at (—1, 2) while 
the other has a minimum (z = 1) at (—1, 2); both functions reach 
their other bounding value at points on the curve 42° — 4y? 
— 12x + 16y — 33 = 0. 
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54. z has max. value 1/4 at (1/2, 1/2). 
55.2 has max. value 5 at (1, 2) and min. value —5 at (—1, —2). 
56. z has min. value 36/13 at (18/13, 12/13). 


57.2 has max. value 1 + = y2 at ES + kx, zt + kz}, 
š 3 
and min. value 1 — = y2 at ES + kx, m + ka). 





8 
58. u has min. value —9 at (—1, 2, —2) and max. value 9 at 


(1,—2,3. 

59. u has maximum value a? at (+a, 0, 0) and minimum value c? at 
(0, 0, +0). 

60. The cube with edge V5, 


61. The box has a square base of edge (2V)13 and has height (+ yp 


62. An equilateral triangle of side + p. 63. Cube. 
64. The triangle with sides 3/4, 3p/4, p/2 rotated about the 
smallest side. 


65. P is at the centre of mass pa VE d 3, ma E Teta d Tto), 
1 2 3 1 2 3 





66. — + + + a = 8. 
67. If the ellipsoid has semi-axes a, b, c the rectangular parallelepiped 
should have edges 2a//3, 2b//3, 2c//3. 
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